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HBM VARIANTS THAT MODULATE BONE MASS AND LIPID LEVELS 

FIELD OF TH E I NVENT I ON 
The present invention relates generally to the field of genetics, genomics and 
5 molecular biology. The invention relates to methods and materials used to isolate, detect and 
sequence a high bone mass gene and corresponding wild-type gene, and mutants thereof. The 
present invention also relates to the high bone mass (HBM) gene, the corresponding wild- 
type gene, and mutants thereof. The genes identified in the present invention are implicated 
in the ontology and physiology of bone development. The invention also provides nucleic 

10 acids, proteins, cloning vectors, expression vectors, transformed hosts, methods of 

developing pharmaceutical compositions, methods of identifying molecules involved in bone 
development, and methods of diagnosing and treating diseases involved in bone development 
and lipid levels. The invention further relates to transgenic animals for studying the HBM 
phenotype and related variant phenotypes, the mechanism of action of the HBM gene and its 

15 variants, and factors and treatments affecting normal and abnormal bone conditions. In 

preferred embodiments, the present invention is directed to methods for treating, diagnosing, 
preventing and screening for normal and abnormal conditions of bone, including metabolic 
bone diseases such as osteoporosis. 

20 BACKGR OUND OF TH E I NVENT I ON 

Two of the most common types of osteoporosis are postmenopausal and senile 
osteoporosis. Osteoporosis affects men as well as women, and, taken with other 
abnormalities of bone, presents an ever-increasing health risk for an aging population. The 
most common type of osteoporosis is that associated with menopause. Most women lose 

25 between 20-60% of the bone mass in the trabecular compartment of the bone within 3-6 years 
after the cessation of menses. This rapid loss is generally associated with an increase of bone 
resorption and formation. However, the resorptive cycle is more dominant and the result is a 
net loss of bone mass. Osteoporosis is a common and serious disease among postmenopausal 
women. There are an estimated 25 million women in the United States alone who are 

30 afflicted with this disease. The results of osteoporosis are both personally harmful, and also 
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account for a large economic loss due to its chronicity and the need for extensive and long- 
term support (hospitalization and nursing home care) from the disease sequelae. This is 
especially true in more elderly patients. Additionally, while osteoporosis is generally not 
thought of as a life-threatening condition, a 20-30% mortality rate is related to hip fractures in 
elderly women. A large percentage of this mortality rate can be directly associated with 
postmenopausal osteoporosis. The costs alone associated with the treatment of osteoporotic 
fractures in the United States is $10 to $15 billion annually. Worldwide incidence of 
osteoporotic hip fractures is estimated to exceed 1.7 million cases. 

The most vulnerable tissue in the bone to the effects of postmenopausal osteoporosis 
is the trabecular bone. This tissue is often referred to as spongy bone and is particularly 
concentrated near the ends of the bone near the joints and in the vertebrae of the spine. The 
trabecular tissue is characterized by small structures which inter-connect with each other as 
well as the more solid and dense cortical tissue which makes up the outer surface and central 
shaft of the bone. This crisscross network of trabeculae gives lateral support to the outer 
cortical structure and is critical to the bio-mechanical strength of the overall structure. In 
postmenopausal osteoporosis, it is primarily the net resorption and loss of the trabeculae 
which lead to the failure and fracture of the bone. In light of the loss of the trabeculae in 
postmenopausal women, it is not surprising that the most common fractures are those 
associated with bones which are highly dependent on trabecular support, e.g., the vertebrae, 
the neck of the femur, and the forearm. Indeed, hip fracture, Colle's fractures, and vertebral 
crush fractures are indicative of postmenopausal osteoporosis. 

One of the earliest generally accepted methods for treatment of postmenopausal 
osteoporosis was estrogen replacement therapy. Although this therapy frequently is 
successful, patient compliance is low, primarily due to the undesirable side-effects of chronic 
estrogen treatment. Frequently cited side-effects of estrogen replacement therapy include 
reinitiation of menses, bloating, depression, and fear of breast or uterine cancer. In order to 
limit the known threat of uterine cancer in those women who have not undergone a 
hysterectomy, a protocol of estrogen and progestin cyclic therapy is often employed. This 
protocol is similar to that which is used in birth control regimens, and often is not tolerated 
by women because of the side-effects characteristic of progestin. More recently, certain 
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antiestrogens, originally developed for the treatment of breast cancer, have been shown in 
experimental models of postmenopausal osteoporosis to be efficacious. Among these agents 
is raloxifene (See, U.S. Patent No. 5,393,763, and Black et al, J. Clin. Invest, 93:63-69 
(1994)). In addition, tamoxifene, a widely used clinical agent for the treatment of breast 
5 cancer, has been shown to increase bone mineral density in post menopausal women 
suffering from breast cancer (Love et al, N. Engl. J. Med., 326:852-856 (1992)). 

Another therapy for the treatment of postmenopausal osteoporosis is the use of 
calcitonin. Calcitonin is a naturally occurring peptide which inhibits bone resorption and has 
been approved for this use in many countries (Overgaard et al, Br. Med. J., 305:556-561 

10 (1992)). The use of calcitonin has been somewhat limited, however. Its effects are very 
modest in increasing bone mineral density and the treatment is very expensive. Another 
therapy for the treatment of postmenopausal osteoporosis is the use of bis-phosphonates. 
These compounds were originally developed for use in Paget's disease and malignant 
hypercalcemia. They have been shown to inhibit bone resorption. Alendronate, one 

1 5 compound of this class, has been approved for the treatment of postmenopausal osteoporosis. 
These agents may be helpful in the treatment of osteoporosis, but these agents also have 
potential liabilities which include osteomalacia, extremely long half-life in bone (greater than 
2 years), and possible "frozen bone syndrome, 11 e.g., the cessation of normal bone 
remodeling. 

20 Senile osteoporosis is similar to postmenopausal osteoporosis in that it is marked by 

the loss of bone mineral density and resulting increase in fracture rate, morbidity, and 
associated mortality. Generally, it occurs in later life, i.e., after 70 years of age. Historically, 
senile osteoporosis has been more common in females, but with the advent of a more elderly 
male population, this disease is becoming a major factor in the health of both sexes. It is not 

25 clear what, if any, role hormones such as testosterone or estrogen have in this disease, and its 
etiology remains obscure. Treatment of this disease has not been very satisfactory. Hormone 
therapy, estrogen in women and testosterone in men, has shown equivocal results; calcitonin 
and bis-phosphonates may be of some utility. 

The peak mass of the skeleton at maturity is largely under genetic control. Twin 

30 studies have shown that the variance in bone mass between adult monozygotic twins is 
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smaller than between dizygotic twins (Slemenda et al, J. Bone Miner. Res., 6:561-567 (1991); 
Young et al, J. Bone Miner. Res., 6:561-567 (1995); Pocock et al, J. Clin. Invest, 80:706-710 
(1987); Kelly et al, J. Bone Miner. Res., 8:11-17 (1993)), and it has been estimated that up to 
60% or more of the variance in skeletal mass is inherited (Krall et al, J. Bone Miner. Res., 
10:S367 (1993)). Peak skeletal mass is the most powerful determinant of bone mass in 
elderly years (Hui et al, Ann. hit. Med., 1 1 1 :355-361 (1989)), even though the rate of age- 
related bone loss in adult and later life is also a strong determinant (Hui et al, Osteoporosis 
Int. , 1 :30-34 (1 995)). Since bone mass is the principal measurable determinant of fracture 
risk, the inherited peak skeletal mass achieved at maturity is an important determinant of an 
individual's risk of fracture later in life. Thus, study of the genetic basis of bone mass is of 
considerable interest in the etiology of fractures due to osteoporosis. 

Recently, a strong interest in the genetic control of peak bone mass has developed in 
the field of osteoporosis. The interest has focused mainly on candidate genes with suitable 
polymorphisms to test for association with variation in bone mass within the normal range, or 
has focused on examination of genes and gene loci associated with low bone mass in the 
range found in patients with osteoporosis. The vitamin D receptor locus (VDR) (Morrison et 
al, Nature, 367:284-287 (1994)), PTH gene (Howard et al, J. Clin. Endocrinol. Metab., 
80:2800-2805 (1995); Johnson et al, J. Bone Miner. Res., 8:11-17 (1995); Gong et al, J. Bone 
Miner. Res., 10:S462 (1995)) and the estrogen receptor gene (Hosoi et al, J. Bone Miner. 
Res., 10:S170 (1995); Morrison et al, Nature, 367:284-287 (1994)) have figured most 
prominently in this work. These studies are difficult because bone mass (the phenotype) is a 
continuous, quantitative, polygenic trait, and is confounded by environmental factors such as 
nutrition, co-morbid disease, age, physical activity, and other factors. Also, this type of study 
design requires large numbers of subjects. In particular, the results of VDR studies to date 
have been confusing and contradictory (Garnero et al, J. Bone Miner. Res., 10:1283-1288 
(1995); Eisman et al, J. Bone. Miner. Res., 10:1289-1293 (1995); Peacock, J. Bone Miner. 
Res., 10:1294-1297 (1995)). Furthermore, the work thus far has not shed much light on the 
mechanism(s) whereby the genetic influences might exert their effect on bone mass. 

While it is well known that peak bone mass is largely determined by genetic rather 
than environmental factors, studies to determine the gene loci (and ultimately the genes) 
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linked to variation in bone mass are difficult and expensive. Study designs which utilize the 
power of linkage analysis, e.g., sib-pair or extended family, are generally more informative 
than simple association studies, although the latter do have value. However, genetic linkage 
studies involving bone mass are hampered by two major problems. The first problem is the 
5 phenotype, as discussed briefly above. Bone mass is a continuous, quantitative trait, and 

establishing a discrete phenotype is difficult. Each anatomical site for measurement may be 
influenced by several genes, many of which may be different from site to site. The second 
problem is the age component of the phenotype. By the time an individual can be identified 
as having low bone mass, there is a high probability that their parents or other members of 

10 prior generations will be deceased and therefore unavailable for study, and younger 

generations may not have even reached peak bone mass, making their phenotyping uncertain 
for genetic analysis. 

Regardless, linkage analysis can be used to find the location of a gene causing a 
hereditary "disorder" and does not require any knowledge of the biochemical nature of the 

15 disorder, i.e., a mutated protein that is believed to cause the disorder does not need to be 

known. Traditional approaches depend on assumptions concerning the disease process that 
might implicate a known protein as a candidate to be evaluated. The genetic localization 
approach using linkage analysis can be used to first find the general chromosomal region in 
which the defective gene is located and then to gradually reduce the size of the region in 

20 order to determine the location of the specific mutated gene as precisely as possible. After 
the gene itself is discovered within the candidate region, the messenger RNA and the protein 
are identified and, along with the DNA, are checked for mutations. 

The genetic localization approach has practical implications since the location of the 
disease can be used for prenatal diagnosis even before the altered gene that causes the disease 

25 is found. Linkage analysis can enable families, even many of those that do not have a sick 
child, to know whether they are carriers of a disease gene and to evaluate the condition of an 
unborn child through molecular diagnosis. The transmission of a disease within families, 
then, can be used to find the defective gene. As used herein, reference to "high bone mass" 
(HBM) is analogous to reference to a disease state, although from a practical standpoint high 

30 bone mass can actually help a subject avoid the disease known as osteoporosis. 

5 
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Linkage analysis is possible because of the nature of inheritance of chromosomes 
from parents to offspring. During meiosis, the two parental homologues pair to guide then- 
proper separation to daughter cells. While they are lined up and paired, the two homologues 
exchange pieces of the chromosomes, in an event called "crossing over" or "recombination." 
The resulting chromosomes are chimeric, that is, they contain parts that originate from both 
parental homologues. The closer together two sequences are on the chromosome, the less 
likely that a recombination event will occur between them, and the more closely linked they 
are. In a linkage analysis experiment, two positions on the chromosomes are followed from 
one generation to the next to determine the frequency of recombination between them. In a 
study of an inherited disease, one of the chromosomal positions is marked by the disease gent 
or its normal counterpart, i.e., the inheritance of the chromosomal region can be determined 
by examining whether the individual displays symptoms of the disorder or not. The other 
position is marked by a DNA sequence that shows natural variation in the population such 
that the two homologues can be distinguished based on the copy of the "marker" sequence 
that they possess. In every family, the inheritance of the genetic marker sequence is 
compared to the inheritance of the disease state. If, within a family carrying an autosomal 
dominant disorder such as high bone mass, every affected individual carries the same form of 
the marker and all the unaffected individuals carry at least one different form of the marker, 
there is a great probability that the disease gene and the marker are located close to each 
other. In this way, chromosomes may be systematically checked with known markers and 
compared to the disease state. The data obtained from the different families is combined, and 
analyzed together by a computer using statistical methods. The result is information 
indicating the probability of linkage between the genetic marker and the disease allowing 
different distances between them. A positive result can mean that the disease is very close to 
the marker, while a negative result indicates that it is far away on that chromosome, or on an 
entirely different chromosome. 

Linkage analysis is performed by typing all members of the affected family at a given 
marker locus and evaluating the co-inheritance of a particular disease state with the marker 
probe, thereby deteiroimng how often the two of them are co-inherited. The recombination 
frequency can be used as a measure of the genetic distance between two gene loci. A 

6 
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recombination frequency of 1% is equivalent to 1 map unit, or 1 centiMorgan (cM), which is 
roughly equivalent to 1,000 kb of DNA. This relationship holds up to frequencies of about 
20% or 20 cM. 

The entire human genome is 3,300 cM long. In order to find an unknown disease 
5 gene within 5-10 cM of a marker locus, the whole human genome can be searched with 

roughly 330 informative marker loci spaced at approximately 10 cM intervals (Botstein et al., 
Am. J. Hum. Genet., 32:314-331 (1980)). The reliability of linkage results is established by 
using a number of statistical methods. The method most commonly used for the analysis of 
linkage in humans is the LOD score method (Morton, Prog. Clin. Biol. Res., 147:245-265 

10 (1984), Morton et al. 9 Am. J. Hum. Genet., 38:868-883 (1986)) which was incorporated into 
the computer program, LIPED, by Ott, Am. J. Hum. Genet, 28:528-529 (1976). LOD scores 
are the logarithm of the ratio of the likelihood that two loci are linked at a given distance to 
that they are not linked (>50 cM apart). The advantage of using logarithmic values is that 
they can be summed among families with the same disease. This becomes necessary given 

15 the relatively small size of human families. 

By convention, a total LOD score greater than + 3.0 (that is, odds of linkage at the 
specified recombination frequency being 1000 times greater than odds of no linkage) is 
considered to be significant evidence for linkage at that particular recombination frequency. 
A total LOD score of less than - 2.0 (that is, odds of no linkage being 100 times greater than 

20 odds of linkage at the specified frequency) is considered to be strong evidence that the two 
loci under consideration are not linked at that particular recombination frequency. Until 
recently, most linkage analyses have been performed on the basis of two-point data, which is 
the relationship between the disorder under consideration and a particular genetic marker. 
However, as a result of the rapid advances in mapping the human genome over the last few 

25 years, and concomitant improvements in computer methodology, it has become feasible to 
carry out linkage analyses using multi-point data. Multi-point analysis provide a 
simultaneous analysis of linkage between the disease and several linked genetic markers, 
when the recombination distance among the markers is known. 

Multi-point analysis is advantageous for two reasons. First, the informativeness of the 

30 pedigree is usually increased. Each pedigree has a certain amount of potential information, 

7 
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dependent on the number of parents heterozygous for the marker loci and the number of 
affected individuals in the family. However, few markers are sufficiently polymorphic as to 
be informative in all those individuals. If multiple markers are considered simultaneously, 
then the probability of an individual being heterozygous for at least one of the markers is 
5 greatly increased. Second, an indication of the position of the disease gene among the 
markers may be determined. This allows identification of flanking markers, and thus 
eventually allows isolation of a small region in which the disease gene resides. Lathrop et 
al, Proc. Natl. Acad. Sci. USA, 81:3443-3446 (1984) have written the most widely used 
computer package, LINKAGE, for multi-point analysis. 
1 0 There is a need in the art for identifying the gene associated with a high bone mass 

phenotype. There is also a need for tools for the study of the high bone mass gene and 
phenotype. More generally there is need for the development of diagnostic tools and 
treatments. The present invention is directed to these, as well as other, important ends. 

15 SUMMARY OF THF. INYEKHQH 

It is an object of the invention to provide a nucleic acid comprising a mutation in 
LRP5 or LRP6 which results in a HBM phenotype when expressed in a cell, wherein said 
HBM phenotype results in bone mass modulation and/or lipid level modulation. Another 
embodiment contemplates that the mutation is located in propeller 1. In another embodiment, 

20 the nculeic acid encodes a mutation comprising at least one mutation of Tables 2 or 3 or 

which results in a mutation of one of the following G171 V, A214V, A65V, M282V, G171K, 
G171F, G171I, G171Q, L200V, T201 V, I202V, or S127V when expressed in a cell, and 
wherein expression of the nucleic acid in a subject results in bone mass modulation and/or 
lipid level modulation. In the instance of LRP6, the mutation is located in a position 

25 equivalent to LRP5 such that expression of the nucleic acid in a subject results in bone mass 
modulation and/or lipid level modulation. In another embodiment, the preferred mutation of 
LRP5 is G171V, A214V, A65V, M282V, G171K, G171F, G171I or G171Q or in an 
equivalent location if dealing with LRP6. 

Another embodiment contemplated herein is a polypeptide encoded by any of the 

30 above nucleic acids, wherein said polypeptide when expressed in a cell modulates Wnt 

8 
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signaling, LRP5 activity and/or LRP6 activity. The polypeptide can additionally or 

alternatively modulate bone mass and/or lipid levels when expressed in a subject. These 

polypeptides or biologically active fragments thereof may preferably contain any of the 

following mutations of Table 2, G171V, A214V, A65V, M282V, G171K, G171F, G171I, 
5 G171Q, L200V, T201V, I202V, or S127V in LRP5 or in a equivalent location in LRP6. The 

most preferred mutations are G171 V, A214V, A65V, M282V, G171K, G171F, G171I or 

G171Q in LRP5 or an equivalent location in LRP6. 

Yet another embodiment contemplates antibodies an immunogenic fragments thereof 

which bind to these proteins. The contemplated antibodies include a monoclonal antibody, a 
10 chimeric antibody, a bispecific antibody, a humanized antibody, a primatized® antibody, a 

human antibody, or a labeled antibody. Preferably, some of the antibodies can discriminate 

between the wild type and variant forms of LRP5 and LRP6. 

Another embodiment contemplates antibodies which bind to polypeptides comprising: 

208 KLYWADAKLSFIHRAN 223 , 277 ALYSPMDIQVLSQER 291 , 
1 5 6l GLEDAAAVDFQFSKGA 73 , 234 EGSLTHPFALTLSG 247 , 249 TLYWTDWQTRSIHACN 264 , 

,44 VLFWQDLDQPRAI 156 , 194 IYWPNGLTIDLEEQKLY 210 , 34 LLLFANRRDVRLVD 47 , 

75 GAVYWTDVSEEAIKQ 89 , 12l KLYWTDSETNRIEVA 135 of LRP5 or an equivalent domain 

on LRP6 or variants thereof. 

The above antibodies can also be used in a composition for modulating bone mass 
20 and/or lipid levels in a subject comprising a therapeutically effective amount of the antibody 

or immunogenic fragment and a pharmaceutically acceptable carrier. 

The invention further contemplates a method of diagnosing a HBM like phenotype in 

a subject comprising: (A) obtaining a biological sample from said subject; (B) exposing the 

sample to one of the described antibodies or immunogenic fragments; and (C) detecting 
25 whether the antibody bound a protein from the biological sample from said subject to 

determine whether the subject has a HBM-like phenotype. 

Another embodiment contemplates a transgenic animal having somatic and/or germ 

cells comprising a nucleic acid which comprises a promoter region that directs protein 

expression in animal and/or human cells operably linked to any of the herein described 
30 nucleic acids, and wherein said transgenic animal has at least three bone parameters 
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modulated by the expression of said nucleic acid. The promoter region can be selected from 
the group consisting of CMV, RSV, SV40, and EF-la, CMVpbActin, histone, type I 
collagen, TGF01, SX2, cfos/cjun, Cbfal, Fra/Jun, Dlx5, osteocalcin, osteopontin, bone 
sialoprotein, and collagenase promoter regions. 
5 A further embodiment of the invention contemplates an animal model for the study of 

bone density modulation and/or lipid level modulation comprising a first group of animals 
composed of any of the described transgenic animals and a second group of control animals. 

Another embodiment provides for a method of identifying agents which modulate the 
activity of an HBM-like nucleic acid comprising: (A) transfecting a cell with a vector of 
10 claim 11; (B) exposing the transfected cell of step (A) to a compound; and (C) deterrnining 

whether the compound modulates the activity of the HBM-like nucleic acid. Such agents can 
include a hormone, a growth factor, a peptide, RNA, siRNA, DNA, a mineral, a vitamin, a 
natural product, or a synthetic organic compound. 

Another aspect of the invention provides for a method for identifying compounds 
15 which modulate the interaction of Dkk with the Wnt signaling pathway comprising: (A) 

transfecting cells with constructs containing any of the described nucleic acids; (B) assessing 
changes in expression of a reporter element linked to a Wnt-responsive promoter; and (C) 
identifying as a Dkk/Wnt interaction modulating compound any compound which alters 
reporter gene expression compared with cells transfected with a Dkk construct alone. The 
20 cells are preferably cancer cells, liver cells or bone cells. The reporter element used is TCF- 
luciferase, tk-Renilla, or a combination thereof. 

Yet another embodiment includes a method of diagnosing a subject as expressing a 
nucleic acid comprising a nucleotide change of Tables 2 or 3 or any other mutations, the 
method comprising the steps of: (A) obtaining a biological sample from the subject; and (B) 
25 assaying for the presence of the nucleotide change which results in HBM phenotype. 

The invention also provides agents identified by the above methods which regulate 
Wnt activity, Dkk activity, bone mass and/or lipid levels. 

B R I EF DBS CR TPTTOW OF TTTR FIGURE S 
30 Fi 8- 1 shows ^ pedigree of the individuals used in the genetic linkage studies. 

10 
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Under each individual is an ID number, the z-score for spinal BMD, and the allele calls for 
the critical markers on chromosome 1 1 . Solid symbols represent "affected" individuals. 
Symbols containing "N" are "unaffected" individuals. DNA from 37 individuals was 
genotyped. Question marks denote unknown genotypes or individuals who were not 
5 genotyped. 

Fig. 2 depicts the BAC/STS content physical map of the HBM region in 1 lql3.3. 
STS markers derived from genes, ESTs, microsatellites, random sequences, and BAC end 
sequences are denoted above the long horizontal line. For markers that are present in GDB 
the same nomenclature has been used. Locus names (Dl 1S####) are listed in parentheses 

10 after the primary name if available. STSs derived from BAC end sequences are listed with 
the BAC name first followed by L or R for the left and right end of the clone, respectively. 
The two large arrows indicate the genetic markers that define the HBM critical region. The 
horizontal lines below the STSs indicate BAC clones identified by PCR-based screening of a 
nine-fold coverage BAC library. Open circles indicate that the marker did not amplify the 

15 corresponding BAC library address during library screening. Clone names use the following 
convention: B for BAC, the plate, row and column address, followed by -H indicating the 
HBM project (i.e., B36F16-H). 

Figs. 3A-3F show the genomic structure of Zmaxl (LRP5) with flanking intron 
sequences. Translation is initiated by the underlined "ATG" in exon 1. The site of the 

20 polymorphism in the HBM gene is in exon 3 and is represented by the underlined fT G," 

whereby this nucleotide is a "T" in the HBM gene. The 3' untranslated region of the mRNA 
is underlined within exon 23 (exon 1, SEQ ID NO:40; exon 2, SEQ ID NO:41; exon 3, SEQ 
ID NO:42; exon 4, SEQ ID NO:43; exon 5, SEQ ID NO:44; exon 6, SEQ ID NO:45; exon 7, 
SEQ ID NO:46; exon 8, SEQ ID NO:47; exon 9, SEQ ID NO:48; exon 10, SEQ ID NO:49; 

25 exon 11, SEQ ID NO:50; exon 12, SEQ ID NO:51; exon 13, SEQ ID NO:52; exon 14, SEQ 
ID NO:53; exon 15, SEQ ID NO:54; exon 16, SEQ ID NO:55; exon 17, SEQ ID NO:56; 
exon 18, SEQ ID NO:57; exon 19, SEQ ID NO:58; exon 20, SEQ ID NO:59; exon 21, SEQ 
ID NO:60; exon 22, SEQ ID NO:61; and exon 23; SEQ ID NO:62). 

Fig. 4 shows the domain organization of Zmaxl (LRP5), including the YWTD 

30 spacers, the extracellular attachment site, the binding site for LDL and calcium, the cysteine- 
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rich growth factor repeats, the transmembrane region, the ideal PEST region with the CK-II 
phosphorylation site and the internalization domain. Fig. 4 also shows the site of the glycine 
to valine change that occurs in the HBM protein. The signal peptide is located at amino acids 
1-31, the extracellular domain is located at amino acids 32-1385, the transmembrane segment 
is located at amino acids 1386-1413, and the cytoplasmic domain is located at amino acids 
1414-1615. 

Fig. 5 is a schematic illustration of the BAC contigs B527D12 and B200E21 in 
relation to the HBM gene. 

Figs. 6A-6J are the nucleotide (SEQ ID NO. l) and amino acid (SEQ ID NO:3) 
sequences of the wild-type gene, Zmaxl (LRP5). The location for the base pair substitution at 
nucleotide 582, a guanine to thymine, (SEQ ID NO: 2 and 4) is underlined. This allelic 
variant is the HBM gene. The HBM gene encodes for a protein with an amino acid 
substitution of glycine to valine at position 171. The 5' untranslated region (UTR) boundaries 
bases 1 to 70, and the 3' UTR boundaries bases 4916-5120. 

Figs. 7A and 7B are northern blot analyses showing the expression of Zmaxl (LRP5) 
in various tissues. 

Fig. 8 is a PCR product analysis. 

Fig. 9 is allele specific oligonucleotide detection of the Zmaxl (LRP5) exon 3 
mutation. 

Fig. 10 is the cellular localization of mouse Zmaxl (LRP5) by in situ hybridization at 
100X magnification using sense and antisense probes. 

Fig. 11 is the cellular localization of mouse Zmaxl (LRP5) by in situ hybridization at 
400X magnification using sense and antisense probes. 

Fig. 12 is the cellular localization of mouse Zmaxl (LRP5) by in situ hybridization of 
osteoblasts in the endosteum at 400X magnification using sense and antisense probes. 

Fig. 13 shows antisense inhibition of Zmaxl (LRP5) expression in MC-3T3 cells. 

Fig. 14 shows a Zmaxl (LRP5) exon 3 allele specific oligonucleotide (ASO) assay 
which illustrates the rarity of the HBM allele (right panels; T-specific ofigo; 58 °C Wash) as 
compared to the wild-type Zmaxl (LRP5) allele (left panels, G-specific oligo; 55 °C Wash). 
The positive spots appearing in the right panels were positive controls. 
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Fig. 15 depicts a model representing the potential role of Zmaxl (LRP5) in focal 
adhesion signaling. 

Fig. 16 depicts a schematic of two Zmaxl (LRP5) gene targeting vectors for the 
knock-out of endogenous mouse Zmax or conditional knock-in of the HBM polymorphism. 
5 B, X, and R indicate BamHl, Xbal, and EcoKL sites in DNA BAC 473 5P5 respectively. 

Exons 3, 4, and 5 are indicated by black rectangles. A G -> T base change is engineered at 
base 24 of exon 3 to produce the HBM polymorphism. The location of a LoxP flanked 
cassette containing a neomycin resistance gene and a synthetic pause sequence and probes 
used for screening and characterizing of ES cell clones are also indicated. 
10 Fig. 17 confirms expression by the transgenic (Le. 9 HBMMCBA and HBMMTIC) and 

wild-type (*.<?., ZmaxWTCBA and ZmaxWTTIC) plasmid constructs. These constructs were 
transiently transfected into HOB-02-02 cells and the mRNA levels determined using 
TaqMan® quantitative PCR. HBMMCBA and ZmaxWTCBA are shown in the left column 
(i.e. CMVpActin) and HBMMTIC and ZmaxWTTIC are shown in the right column (i.e. 
1 5 Type I collagen) of the Table. 

Fig. 18 depicts a comparison between the human and mouse TaqMan® Primer/Probe 
sets. HOB (HOB-03-C5) and mouse (MC-3T3-E1) osteoblastic cell mRNA was analyzed 
using the probes and primers. 

Fig. 19 depicts the quantification of human Zmaxl (LRP5) mRNA expressed in a 
20 mixed human and mouse RNA background using the TaqMan® Primer/Probe sets. Results 
are presented in Human Zmaxl (LRP5) mRNA added (ng) versus Human Zmaxl (LRP5) 
mRNA measured (ng). 

Fig. 20 depicts expression of HBM in transgenic mice based on mRNA expression 
analyzed by TaqMan®. 

25 Fig. 21A-F Panels A-C depict the analysis of various transgenic mouse lines which 

express the HBMMCBA construct in spine (Fig. 21 A), femur (Fig. 21B) and total body (Fig. 
21C). Panels D-F depict the analysis of various transgenic mouse lines which express the 
HBMMTIC construct in spine (Fig. 21D), femur (Fig. 21E) and total body (Fig. 21F). 

Fig. 22 depicts changes in BMD in HBM transgenic mice (i.e. 9 HBMMCBA and 

30 HBMMTIC constructs) at 5 weeks using in vivo pDXA* analysis. The BMD changes are 
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presented as compared to wild-type animals which were also only 5 weeks old. 

Fig. 23 depicts changes in BMD in HBM transgenic mice (i.e., HBMMCBA and 
HBMMTIC constructs) at 9 weeks using in vivo pDXA* analysis. The BMD changes are 
presented as compared to wild-type animals which were also only 9 weeks old. 
5 Fig. 24 (A-D) presents the sequence of the insert of the gene, HBMGI_2AS, 

subcloned into the vector (SEQ ID NO:759). 

Fig. 25 (A-D) presents the sequence of the insert of the gene, ZMAXGI_3AS, 
subcloned into the vector (SEQ ID NO:760). 

Fig. 26 (A-C) presents an alignment of human (SEQ ID NO:761) and mouse (SEQ 
10 ID NO:762) Zmaxl (LRP5) amino acid sequences. 

Fig. 27 (A-C) presents an alignment of human LRP5 (Zmaxl) (SEQ ID NO:763) and 
LRP6 (SEQ ID NO:764) amino acid sequences. 

Fig. 28 shows a schematic of the components of the Wnt signal transduction pathway. 
Schematic obtained from: ht1p://www.stanford.edu/~musse/pathways/cell2.html. 
15 Fi 8- 29 shows that the mutation G171F in LRP5 produces a greater activation of the 

Wnt pathway than LRP5 which is consistent with HBM activity. 

Fig. 30 shows that the mutation M282V in LRP5 produces an activation of the Wnt 
pathway which is consistent with HBM activity in U20S cells. 

Fig. 31 shows a table of proteins identified in a Y2H screen using a Dkk-1 bait 
20 sequence. These proteins are identified by both their nucleic acid and amino acid accession 
numbers. 

Fig. 32 shows the differential binding of an antibody generated to a sequence (a.a. 
165-177) containing the HBM mutation in LRP5 in LRP5 and HBM virus-infected cells. 
Fig. 33 shows a diagram of the Xenopus Embryo Assay for Wnt activity. 
25 Fi S- 34 shows the effects of Zmax/LRP5 and HBM on Wnt signaling in the Xenopus 

embryo assay. 

Fig, 35 shows the effects of Zmax/LRP5 and HBM on induction of secondary axis 
formation in the Xenopus embryo assay. 

Fig. 36 shows the effects of human Dkk-1 on the repression of the canonical Wnt 
30 pathway. 
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Fig, 37 shows the effects of human Dkk-1 on Zmax/LRP5 and HBM-mediated Wnt 
signaling. 

Fig. 38 shows a table of peptide aptamer insert sequences identified in a Y2H screen 
using a LRP5 ligand binding domain bait sequence. 
5 Fig. 39 shows pcDNA3.1 construct names with nucleotide sequences for LRP5- 

binding peptide aptamers, Dkk-1 peptides and control constructs. 

Fig. 40 shows the results of a minimum interaction domain mapping screen of Dkk-1 
with LRP5. At the top, a map of Dkk-1 showing the location of the signal sequence, and 
cysteine rich domains 1 and 2. Below, the extent of domains examined using LRP5 LBD 
10 baits, LBD1 and LBD4. To the right, scoring of the binding results observed in the 
experiment. 

Fig. 41 shows the effects of Dkk-1 and Dkk-2 on Wntl signaling with coreceptors 
LRP5, HBM, and LRP6 in HOB03CE6 cells. 

Fig. 42 shows the effects of Dkk-1 and Dkk-2 on Wnt3a signaling with coreceptors 
15 LRP5, HBM, and LRP6 in HOB03CE6 cells. 

Fig. 43 demonstrates that the LRP5-LBD peptide aptamer 262 activates Wnt signaling 
in the presence of Wnt3a in U20S cells. 

Fig. 44 shows the amino acid sequences for the corresponding LRP5-binding 
peptides, Dkk-1 peptide aptamers and control constructs in Figure 39. 
20 Fig. 45 shows that Dkk-1 represses Wnt3a-mediated Wnt signaling in U20S bone 

cells using the cell-based reporter gene assay for high throughput screening. 

Fig. 46 demonstrates that Wntl -HBM generated signaling is not efficiently inhibited 
by Dkk-1 in U20S bone cells while LRP5 and LRP6-mediated signaling are using the cell- 
based reporter gene assay for high throughput screening. 
25 Fig. 47 shows that the TCF signal in the cell-based reporter gene assay for high 

throughput screening can be modulated by Dkk-1 and Dkk-1 -AP without Wnt DNA 
transfection. 

Fig. 48 shows the morphological results in the Xenopus assay using aptamers 261 and 
262 from the LRP5-LBD to activate Wnt signaling. 
30 Fig. 49 demonstrates that LRP5-LBD aptamers 261 and 262 induce Wnt signaling 
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over other LRP5 aptamers. 

Fig. 50 depicts the LRP5 domain structure. The symbols are defined at the upper 
right. The structural domains are numbered successively from top to bottom corresponding 
to the N-terminal to C-terminal ends of the protein as Propeller 1, EGF-like Domain 1, 
Propeller 2, EGF-like Domain 2, and so forth. The structure was determined using the motifs 
and nomenclature described in Springer et aL, 1998 J. Mol Biol 283: 837-62. 

Fig. 51 CLUSTALW (1.8) multiple sequence alignment of the (3-propeller segments 
of LRP5 from human (af077820h) and mouse (af064984m and afD77847m), LRP6 from 
human (af074264h) and mouse (af074265m), and the sequences from proteins with modeled 
6-bladed p-propellers: chicken LRPl(llpx) and human nidogen (lndx). For the LPRs, the 
final suffix letter designates which of the four propeller domains the sequence comes from 
(a = > prop. #1, b = > prop. #2, etc.). The final four lines give the secondary structure 
assignments predicted by DSC for all sequences given in this alignment (H = helix, E = 
strand and C = coil) and the weights assigned to each structural type at each position on a 
scale of 0 (least probable) to 9 (most probable). Sequences corresponding to Springer's 
models' strand positions are underlined and color-coded according to which of the six 
blades they belong to. In the propeller geometry, alternate loops fall on opposite faces of 
the disc-shaped domain. The loops on the "top" face (i.e. , opposite the points of entry and 
exit of the chain from the structure) are colored red. The position of the G171V mutation 
is marked with "*". 

Fig. 52 displays the functional effect of mutations on side chain interactions in HBM 
protein as compared the wild-type Zmaxl (LRP5) protein. It also shows the side chain 
interactions of G171F, an HBM like variant. 

Fig. 53 displays the structural change due to the OPPG mutation versus the wild-type, 
Zmaxl (LRP5). Panel A depicts the homology model of LRP5's second propeller domain; 
Panel B depicts the third propeller domain and Panel C depicts the second propeller domain 
with two mutations. 
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DETAILED DESCRIPTION OF THE TNVENTTON 

1. Definitions 

To aid in the understanding of the specification and claims, the following definitions 
are provided. 

5 "Gene" refers to a DNA sequence that encodes through its template or messenger 

RNA a sequence of amino acids characteristic of a specific peptide. The term "gene" 
includes intervening, non-coding regions, as well as regulatory regions, and can include 5 f 
and 3* ends. 

By "nucleic acid" is meant to include single stranded and double stranded nucleic 

10 acids including, but not limited to DNAs, RNAs (e.g., mRNA, tRNAs, siRNAs), cDNAs, 

recombinant DNA (rDNA), rRNAs, antisense nucleic acids, oligonucleotides, and oligomers, 
and polynucleotides. The term may also include hybrids such as triple stranded regions of 
RNA and/or DNA or double stranded RNA:DNA hybrids. The term also is contemplated to 
include modified nucleic acids such as, but not limited to biotinylated nucleic acids, tritylated 

15 nucleic acids, fluorophor labeled nucleic acids, inosine, and the like. 

"Gene sequence" refers to a nucleic acid molecule, including DNA which contains a 
non-transcribed or non-translated sequence, which comprises a gene. The term is also 
intended to include any combination of gene(s), gene fragment(s), non-transcribed 
sequence(s) or non-translated sequence(s) which are present on the same DNA molecule. 

20 The nucleic acid sequences of the present invention may be derived from a variety of 

sources including DNA, cDNA, synthetic DNA, synthetic RNA or combinations thereof. 
Such sequences may comprise genomic DNA which may or may not include naturally 
occurring introns. Moreover, such genomic DNA may be obtained in association with 
promoter regions and/or poly (A) sequences. The sequences, genomic DNA or cDNA may 

25 be obtained in any of several ways. Genomic DNA can be extracted and purified from 

suitable cells by means well known in the art. Alternatively, mRNA can be isolated from a 
cell and used to produce cDNA by reverse transcription or other means. 

"cDNA" refers to complementary or copy DNA produced from an RNA template by 
the action of RNA-dependent DNA polymerase (reverse transcriptase). Thus, a "cDNA 

30 clone" means a duplex DNA sequence for which one strand is complementary to an RNA 
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molecule of interest, carried in a cloning vector or PCR amplified. cDNA can also be single 
stranded after first strand synthesis by reverse transcriptase. In this form, it is a useful PCR 
template and does not need to be carried in a cloning vector. This term includes genes from 
which the intervening sequences have been removed. Thus, the term "gene", as sometimes 
5 used generically, can also include nucleic acid molecules comprising cDNA and cDNA 
clones. 

"Recombinant DNA" means a molecule that has been engineered by splicing in vitro a 
cDNA or genomic DNA sequence or altering a sequence by methods such as PCR 
mutagenesis. 

1 0 "Cloning" refers to the use of in vitro recombination techniques to insert a particular 

gene or other DNA sequence into a vector molecule. In order to successfully clone a desired 
gene, it is necessary to use methods for generating DNA fragments, for joining the fragments 
to vector molecules, for introducing the composite DNA molecule into a host cell in which it 
can replicate, and for selecting the clone having the target gene from amongst the recipient 

15 host cells. 

"cDNA library" refers to a collection of recombinant DNA molecules containing 
cDNA inserts which together comprise the entire or a partial repertoire of genes expressed in 
a particular tissue or cell source. Such a cDNA library can be prepared by methods known to 
one skilled in the art and described by, for example, Cowell and Austin, "cDNA Library 

20 Protocols," Methods in Molecular Biology (1 997). 

"Cloning vehicle" refers to a plasmid or phage DNA or other DNA sequence which is 
able to replicate in a host cell. This term can also include artificial chromosomes such as 
BACs and YACs. The cloning vehicle is characterized by one or more endonuclease 
recognition sites at which such DNA sequences may be cut in a determinable fashion without 

25 loss of an essential biological function of the DNA, which may contain a marker suitable for 
use in the identification of transformed cells. 

"Expression" refers to the process comprising transcription of a gene sequence and 
subsequent processing steps, such as translation of a resultant mRNA to produce the final end 
product of a gene. The end product may be a protein (such as an enzyme or receptor) or a 

30 nucleic acid (such as a fRNA, antisense RNA, or other regulatory factor). The term 

18 



BNSDOCID: <WO 02092000A2_L> 



WO 02/092000 



PCT/US02/14877 



"expression control sequence" refers to a sequence of nucleotides that control or regulate 
expression of structural genes when operably linked to those genes. These include, for 
example, the lac systems, the trp system, major operator and promoter regions of the phage 
lambda, the control region of fd coat protein and other sequences known to control the 
5 expression of genes in prokaiyotic or eukaryotic cells. Expression control sequences will 
vary depending on whether the vector is designed to express the operably linked gene in a 
prokaryotic or eukaryotic host, and may contain transcriptional elements such as enhancer 
elements, termination sequences, tissue-specificity elements and/or translational initiation and 
termination sites. 

10 "Expression vehicle" refers to a vehicle or vector similar to a cloning vehicle but 

which is capable of expressing a gene which has been cloned into it, after transformation into 
a host. The cloned gene is usually placed under the control of (i.e., operably linked to) an 
expression control sequence. 

"Operator" refers to a DNA sequence capable of interacting with the specific 

15 repressor, thereby controlling the transcription of adjacent gene(s). 

"Promoter" refers to a DNA sequence that can be recognized by an RNA polymerase. - 
The presence of such a sequence permits the RNA polymerase to bind and initiate - 
transcription of operably linked gene sequences. 

"Promoter region" is intended to include the promoter as well as other gene sequences 

20 which may be necessary for the initiation of transcription. The presence of a promoter region 
is sufficient to cause the expression of an operably linked gene sequence. The term 
"promoter" is sometimes used in the art to generically indicate a promoter region. Many 
different promoters are known in the art which direct expression of a gene in a certain cell 
types. Tissue-specific promoters can comprise nucleic acid sequences which cause a greater 

25 (or decreased) level of expression in cells of a certain tissue type. 

"Operably linked" means that the promoter controls the initiation of expression of the 
gene. A promoter is operably linked to a sequence of proximal DNA if upon introduction 
into a host cell the promoter determines the transcription of the proximal DNA sequence(s) 
into one or more species of RNA. A promoter is operably linked to a DNA sequence if the 

30 promoter is capable of initiating transcription of that DNA sequence. 
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"Prokaryote" refers to all organisms without a true nucleus, including bacteria. 
"Eukaryote" refers to organisms and cells that have a true nucleus, including 
mammalian cells. 

"Host" includes prokaryotes and eukaryotes, such as yeast and filamentous fungi, as 
well as plant and animal cells. The term includes an organism or cell that is the recipient of a 
replicable expression vehicle. 

The term "animal" is used herein to include all vertebrate animals, except humans. It 
also includes an individual animal in all stages of development, including embryonic and 
fetal stages. Preferred animals include higher eukaryotes such as avians, rodents (e.g., mice, 
rabbits, rats, chinchillas, guinea pigs, hamsters and the like), and mammals. Preferred 
mammals include bovine, equine, feline, canine, ovine, caprine, porcine, buffalo, humans, 
and primates. 

A "transgenic animal" is an animal containing one or more cells bearing genetic 
information received, directly or indirectly, by deliberate genetic manipulation or by 
inheritance from a manipulated progenitor at a subcellular level, such as by microinjection or 
infection with a recombinant viral vector (e.g., adenovirus, retrovirus, herpes virus, adeno- 
associated virus, lentivirus). This introduced DNA molecule may be integrated within a 
chromosome, or it may be extra-chromosomally replicating DNA. 

"Embryonic stem cells" or "ES cells" as used herein are cells or cell lines usually 
derived from embryos which are pluripotent meaning that they are un-differentiated cells. 
These cells are also capable of incorporating exogenous DNA by homologous recombination 
and subsequently developing into any tissue in the body when incorporated into a host 
embryo. It is possible to isolate pluripotent cells from sources other than embryonic tissue by 
methods which are well understood in the art. 

Embryonic stem cells in mice have enabled researchers to select for transgenic cells 
and perform gene targeting. This allows more genetic engineering than is possible with other 
transgenic techniques. For example, mouse ES cells are relatively easy to grow as colonies in 
vitro. The cells can 'be transfected by standard procedures and transgenic cells clonally 
selected by antibiotic resistance. See, for example, Doetschman et aL., 1994, Gene transfer 
in embryonic stem cells. In Pinkert (Ed.) Transgenic Animal Technology: A Laboratory 
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Handbook . Academic Press Inc., New York, pp.1 15-146. Furthermore, the efficiency of this 
process is such that sufficient transgenic colonies (hundreds to thousands) can be produced to 
allow a second selection for homologous recombinants. Mouse ES cells can then be 
combined with a normal host embryo and, because they retain their potency, can develop into 
5 all the tissues in the resulting chimeric animal, including the germ cells. The transgenic 
modification can then be transmitted to subsequent generations. 

Methods for deriving embryonic stem (ES) cell hues in vitro from early 
preimplantation mouse embryos are well known. See for example, Evans et al, 1981 Nature 
29: 154-6 and Martin, 1981, Proc. Nat Acad. Set USA, 78: 7634-8. ES cells can be passaged 

10 in an undifferentiated state, provided that a feeder layer of fibroblast cells or a differentiation 
inhibiting source is present. 

The term "somatic cell" indicates any animal or human cell which is not a sperm or 
egg cell or is capable of becoming a sperm or egg cell. The term "germ cell" or "germ-line 
cell" refers to any cell which is either a sperm or egg cell or is capable of developing into a 

15 sperm or egg cell and can therefore pass its genetic information to offspring. The term "germ 
cell-line transgenic animal" refers to a transgenic animal in which the genetic information 
was incorporated in a germ line cell, thereby conferring the ability to transfer the information 
to offspring. If such offspring in fact possess some or all of that information, then they, too, 
are transgenic animals. 

20 The genetic alteration of genetic information may be foreign to the species of animal 

to which the recipient belongs, or foreign only to the particular individual recipient. In the 
last case, the altered or introduced gene may be expressed differently than the native gene. 

"Fragment" of a gene refers to any portion of a gene sequence. A "biologically active 
fragment" refers to any portion of the gene that retains at least one biological activity of that 

25 gene. For example, the fragment can perhaps hybridize to its cognate sequence or is capable 
of being translated into a polypeptide fragment encoded by the gene from which it is derived. 

"Variant" refers to a gene that is substantially similar in structure and biological 
activity or immunological characteristics to either the entire gene or to a fragment of the 
gene. Provided that the two genes possess a similar activity, they are considered variant as 

30 that term is used herein even if the sequence of encoded amino acid residues is not identical. 
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Preferentially, as used herein (unless otherwise defined) the variant is one of LRP5, HBM or 
LRP6. The variant preferably is one that yields an HBM-like phenotype (i.e., enhances bones 
mass and/or modulates lipid levels). These variants include missense mutations, single 
nucleotide polymorphisms (SNPs), mutations which result in changes in the amino acid 
sequence of the protein encoded by the gene or nucleic acid, and combinations thereof, as 
well as com in the exon domains of the HBM gene and mutations in LRP5 or LRP6 which 
result in an HBM like phenotype. 

"Amplification of nucleic acids" refers to methods such as polymerase chain reaction 
(PCR), ligation amplification (or ligase chain reaction, LCR) and amplification methods 
based on the use of Q-beta replicase. These methods are well known in the art and described, 
for example, in U.S. Patent Nos. 4,683,195 and 4,683,202. Reagents and hardware for 
conducting PCR are commercially available. Primers useful for amplifying sequences from 
the HBM region are preferably complementary to, and hybridize specifically to sequences in 
the HBM region or in regions that flank a target region therein. HBM sequences generated 
by amplification may be sequenced directly. Alternatively, the amplified sequence(s) may be 
cloned prior to sequence analysis. 

"Antibodies" may refer to polyclonal and/or monoclonal antibodies and fragments 
thereof, and immunologic binding equivalents thereof, that can bind to the HBM proteins and 
fragments thereof or to nucleic acid sequences from the HBM region, particularly from the 
HBM locus or a portion thereof. Preferred antibodies also include those capable of binding to 
LRP5, LRP6 and HBM variants. The term antibody is used both to refer to a homogeneous 
molecular entity, or a mixture such as a serum product made up of a plurality of different 
molecular entities. Proteins may be prepared synthetically in a protein synthesizer and 
coupled to a carrier molecule and injected over several months into rabbits. Rabbit sera is 
tested for immunoreactivity to the HBM protein or fragment. Monoclonal antibodies may be 
made by injecting mice with the proteins, or fragments thereof. Monoclonal antibodies will 
be screened by ELISA and tested for specific immunoreactivity with HBM protein or 
fragments thereof. Harlow et al, Antibodies: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY (1988) and Using Antibodies: A Laboratory Manual, 
Harlow, Ed and Lane, David (Cold Spring Harbor Press, 1999). These antibodies will be 
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useful in assays as well as pharmaceuticals. By "antibody" is meant to include but not 
limited to polyclonal, monoclonal, chimeric, human, humanized, bispecific, multispecific, 
primatized™ antibodies. 

"HBM protein" refers to a protein that is identical to a Zmaxl (LRP5) protein except 
5 that it contains an alteration of glycine 171 to a valine. An HBM protein is defined for any 
organism that encodes a Zmaxl (LRP5) true homolog. For example, a mouse HBM protein 
refers to the mouse Zmaxl (LRP5) protein having the glycine 170 to valine substitution. 

By "HBM-like" is meant a variant of LRP5, LRP6 or HBM which when expressed in 
a cell is capable of modulating bone mass, lipid levels, Dkk activity, and/or Wnt activity. 

10 In one embodiment of the present invention, "HBM gene" refers to the genomic DNA 

sequence found in individuals showing the HBM characteristic or phenotype, where the 
sequence encodes the protein indicated by SEQ ID NO: 4. The HBM gene and the Zmaxl 
(LRP5) gene are allelic. The protein encoded by the HBM gene has the property of causing 
elevated bone mass, while the protein encoded by the Zmaxl (LRP5) gene does not. The 

1 5 HBM gene and the Zmaxl (LRP5) gene differ in that the HBM gene has a thymine at position 
582, while the Zmaxl gene has a guanine at position 582. The HBM gene comprises the 
nucleic acid sequence shown as SEQ ID NO: 2. The HBM gene may also be referred to as an 
"HBM polymorphism." Other HBM genes may further have silent mutations, such as those 
discussed in Section 3 below. 

20 In alternative embodiments of the present invention, "HBM gene" may also refer to 

any allelic variant of Zmaxl (LRP5) or LRP6 which results in the HBM phenotype. Such 
variants may include alteration from the wild-type protein coding sequence as described 
herein and/or alteration in expression control sequences of Zmaxl (LRP5) or contains an 
amino acid mutation in LRP5 or LRP6, such that the resulting protein produces a phenotype 

25 which enhances bone mass and/or modulates lipid levels. A preferred example of such a 
variant is an alteration of the endogenous Zmaxl (LRP5) promoter region resulting in 
increased expression of the Zmaxl (LRP5) protein. 

"Normal," "wild-type," "unaffected", "Zmaxl", "Zmax", "LR3" and "LRP5" all refer 
to the genomic DNA sequence that encodes the protein indicated by SEQ ID NO: 3. LRP5 

30 has also been referred to LRP7 in mouse. Zmaxl , LRP5 and Zmax may be used 
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interchangeably throughout the specification and are meant to be the same gene, perhaps only 
relating to the gene in a different organism. The Zmaxl gene has a guanine at position 582 in 
the human sequence. The Zmaxl gene of human comprises the nucleic acid sequence shown 
as SEQ ID NO: 1. "Normal," "wild-type," "unaffected", "Zmaxl" and "LRP5" also refer to 
5 allelic variants of the genomic sequence that encodes proteins that do not contribute to 

elevated bone mass. The Zmaxl (LRP5) gene is common in the human population, while the 
HBM gene is rare. 

"5YWTD+EGF" refers to a repeat unit found in the Zmaxl (LRP5) protein, consisting 
of five YWTD repeats followed by an EGF repeat. 
10 "Bone development" generally refers to any process involved in the change of bone 

over time, including, for example, normal development, changes that occur during disease 
states, and changes that occur during aging. This may refer to structural changes in and 
dynamic rate changes such as growth rates, resorption rates, bone repair rates, and etc. "Bone 
development disorder" particularly refers to any disorders in bone development including, for 
15 example, changes that occur during disease states and changes that occur during aging. Bone 
development may be progressive or cyclical in nature. Aspects of bone that may change 
during development include, for example, mineralization, formation of specific anatomical 
features, and relative or absolute numbers of various cell types. 

"Bone modulation" or "modulation of bone formation" refers to the ability to affect 
20 any of the physiological processes involved in bone remodeling, as will be appreciated by one 
skilled in the art, including, for example, bone resorption and appositional bone growth, by, 
inter alia, osteoclastic and osteoblastic activity, and may comprise some or all of bone 
formation and development as used herein. 

Bone is a dynamic tissue that is continually adapting and renewing itself through the 
25 removal of old or unnecessary bone by osteoclasts and the rebuilding of new bone by 

osteoblasts. The nature of the coupling between these processes is responsible both for the 
modeling of bone during growth as well as the maintenance of adult skeletal integrity through 
remodeling and repair to meet the everyday needs of mechanical usage. There are a number 
of diseases of bone that result from an uncoupling of the balance between bone resorption and 
30 formation. With aging there is a gradual "physiologic" imbalance in bone turnover, which is 
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particvilarly exacerbated in women due to menopausal loss of estrogen support, that leads to a 
progressive loss of bone. The reduction in bone mass and deterioration in bone architecture 
results in an increase in bone fragility and susceptibility to spontaneous fractures. For every 
10 percent of bone that is lost the risk of fracture doubles. Individuals with bone mineral 
5 density (BMD) in the spine or proximal femur 2.5 or more standard deviations below normal 
peak bone mass are classified as osteoporotic. However, osteopenic individuals with BMD 
between 1 and 2.5 standard deviations below the norm are clearly at risk of suffering bone 
loss related disorders. 

Bone modulation may be assessed by measuring parameters such as bone mineral 

10 density (BMD) and bone mineral content (BMC) by pDXA X-ray methods, bone size, 

thickness or volume as measured by X-ray, bone formation rates as measured for example by 
calcien labeling, total, trabecular, and mid-shaft density as measured by pQCT and/or mCT 
methods, connectivity and other histological parameters as measured by mCT methods, 
mechanical bending and compressive strengths as preferably measured in femur and 

15 vertebrae respectively. Due to the nature of these measurements, each may be more or less 

appropriate for a given situation as the skilled practitioner will appreciate. Furthermore, i 
parameters and methodologies such as a clinical history of freedom from fracture, bone 
shape, bone morphology, connectivity, normal histology, fracture repair rates, and other bone 
quality parameters are known and used in the art. Most preferably, bone quality may be 

20 assessed by the compressive strength of vertebra when such a measurement is appropriate. 
Bone modulation may also be assessed by rates of change in the various parameters. Most 
preferably, bone modulation is assessed at more than one age. 

"Normal bone density" refers to a bone density within two standard deviations of a Z 
score of 0 in the context of the HBM linkage study. In a general context, the range of normal 

25 bone density parameters is determined by routine statistical methods. A normal parameter is 
within about 1 or 2 standard deviations of the age and sex normalized parameter, preferably 
about 2 standard deviations. A statistical measure of meaningfulness is the P value which can 
represent the likelihood that the associated measurement is significantly different from the 
mean. Significant P values are P < 0.05, 0.01, 0.005, and 0.001, preferably at least P < 0.01. 

30 "HBM" refers to "high bone mass" although this term may also be expressed in terms 
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of bone density, mineral content, and size. 

The "HBM phenotype" and "HBM-like phenotype" may be characterized by an 
increase of about 2 or more standard deviations, preferably 2, 2.5, 3, or more standard 
deviations in 1, 2, 3, 4, 5, or more quantitative parameters of bone modulation, preferably 
bone density and mineral content and bone strength parameters, above the age and sex norm 
for that parameter. The HBM phenotype and HBM-like phenotype are characterized by 
statistically significant increases in at least one parameter, preferably at least 2 parameters, 
and more preferably at least 3 or more parameters. The HBM phenotype and the HBM-like 
phenotype may also be characterized by an increase in one or more bone quality parameters 
and most preferably increasing parameters are not accompanied by a decrease in any bone 
quality parameters. Most preferably, an increase in bone modulation parameters and/or bone 
quality measurements is observed at more than one age. The HBM phenotype and HBM-like 
phenotype also includes changes of lipid levels, Wht activity and/or Dkk activity. 

A "Zmaxl system" or "LRP5 system" refers to a purified protein, cell extract, cell, 
animal, human or any other composition of matter in which Zmaxl (LRP5) is present in a 
normal or mutant form. 

The terms "isolated" and "purified" refer to a substance altered by hand of man from 
the natural environment. An isolated peptide may be for example in a substantially pure form 
or otherwise displaced from its native environment such as by expression in an isolated cell 
line or transgenic animal. An isolated sequence may for example be a molecule in 
substantially pure form or displaced from its native environment such that at least one end of 
said isolated sequence is not contiguous with the sequence it would be contiguous with in 
nature. 

A "surrogate marker" refers to a diagnostic indication, symptom, sign or other feature 
that can be observed in a cell, tissue, human or animal that is correlated with the HBM gene 
or elevated bone mass or both, but that is easier to measure than bone density. The general 
concept of a surrogate marker is well accepted in diagnostic medicine. 

The present invention encompasses the Zmaxl (LRP5) gene and Zmaxl (LRP5) 
protein in the forms indicated by SEQ ID NOS: 1 and 3, respectively, and other closely 
related variants, as well as the adjacent chromosomal regions of Zmaxl (LRP5) necessary for 
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its accurate expression. In a preferred embodiment, the present invention is directed to at 
least 15 contiguous nucleotides of the nucleic acid sequence of SEQ ID NO: 1. 

The present invention further encompasses variants of the LRP6 gene and its 
corresponding protein which result in an enhanced bone mass and/or modulate lipids and/or 
5 modulate the Wnt signaling pathway. 

"Biologically active" refers to those forms of proteins and polypeptides, including 
conservatively substituted variants, alleles of genes encoding a protein or polypeptide 
fragments of proteins which retain a biological and/or immunological activity of the wild- 
type protein or polypeptide. Preferably the activity is one which induces a change in LRP5, 
10 LRP6, or Dkk activity, such as inhibiting the interaction of LRP5 or LRP6 or variants thereof 
with Dkk, or Dkk with another ligand binding partner {e.g., Dkk-1 with a Dkk-1 interacting 
protein such as those shown in Fig. 31). By biologically active is also meant to include any 
form which modulates Wnt signaling. 

By "modulate"and "regulate" is meant methods, conditions, or agents which increase 
15 or decrease the wild-type activity of an enzyme, inhibitor, signal transducer, receptor, 

transcription activator, co-factor, and the like. This change in activity can be an increase or 
decrease of mRNA translation, mRNA or DNA transcription, and/or mRNA or protein 
degradation, which may in turn correspond to an increase or decrease in biological activity. 
By " modulated activity" and "regulated activity" is meant any activity, condition, 
20 disease or phenotype which is modulated by a biologically active form of a protein. 

Modulation may be effected by affecting the concentration or subcellular localization of 
biologically active protein, i.e., by regulating expression or degradation, or by direct agonistic 
or antagonistic effect as, for example, through inhibition, activation, binding, or release of 
substrate, modification either chemically or structurally, or by direct or indirect interaction 
25 which may involve additional factors. 

By "effective amount" or "dose effective amount" or "therapeutically effective 
amount" is meant an amount of an agent which modulates a biological activity of the 
polypeptide of the invention. 

By "immunologically active" is meant any immunoglobulin protein or fragment 
30 thereof which recognizes and binds to an antigen. 
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By "Dkk" is meant to refer to the nucleic acids and proteins of members of the Dkk 
(Dickkopf) family. This includes, but is not limited to, Dkk-1, Dkk-2, Dkk-3, Dkk-4, Soggy, 
and related Dkk proteins. Dkk-1 is a preferred embodiment of the present invention. 
However, the Dkk proteins have substantial homology and one skilled in the art will 
appreciate that all of the embodiments of the present invention utilizing Dkk-1 may also be 
utilized with the other Dkk proteins. 

By "Dkk-1" is meant to refer to the Dkk-1 protein and nucleic acids which encode the 
Dkk-1 protein. Dkk-1 refers to Dickkopf-1, and mXenopus it is related to at least Dkk-2, 
Dkk-3, and Dkk-4 (see Krupnik et al, 1999 Gene 238: 301-313). Dkk-1 was first identified 
mXenopus (Glinka et al, 1998 Nature 391: 357-62). It was recognized as a factor capable of 
inducing ectopic head formation in the presence of inhibition of the BMP pathway. It was 
then also found to inhibit the axis-inducing activity of several Xenopus Wnt molecules by 
acting as an extracellular antagonist of Wnt signaling. Mammalian homologs have been 
found including Dkk-1, Dkk-2, Dkk-3, Dkk-4 and soggy (Fedi et al, 1999 J. Biol. Chern. 
274: 19465-72; and Krupnick et al. 1999). Human Dkk-1 was also referred to as sk (Fedi et 
al. 1999). As used herein, Dkk-1 is meant to include proteins from any species having a Wnt 
pathway in which Dkk-1 interacts. Particularly preferred are mammalian species (e.g., 
murine, caprine, canine, bovine, feline, equine, primate, ovine, porcine and the like), with 
particularly preferred mammals being humans. Nucleic acid sequences encoding Dkk-1 
include, but are not limited to human Dkk-1 (GenBank Accession Nos. AH009834, 
XM_005730, AF261 158, AF261 157, AF177394, AF127563 and NM_012242), Mus 
musculus dickkopf homolog 1 (GenBank Accession No. NM_010051), an&Danio rerio 
dickkopf-1 (GenBank Accession Nos. AF1 16852 and AB023488). The genomic sequences 
with exon annotation are GenBank Accession Nos. AF261 157 and AF261 158. Also 
contemplated are homologs of these sequences which have Dkk-1 activity in the Wnt 
pathway. Dkk-1 amino acid sequences include, but are not limited to human dickkopf 
homolog 1 (GenBank Accession Nos. AAG15544, BAA34651, NP_036374, AAF02674, 
AAD21087, and XP_005730), Danio rerio (zebrafish) dickkopfl (GenBank Accession Nos. 
BAA82135 and AAD22461) and murine dickkopf-1 (GenBank Accession Nos. 054908 and 
NP_034181). Variants and homologs of these sequences which possess Dkk-1 activity are 
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also included when referring to Dkk- 1 . 

By "LRP5 mediated", "LRP6 mediated", and "Dick mediated" disorder, condition or 
disease is any abnormal state that involves LRP5, LRP6 and/or Dkk activity. The abnormal 
state can be induced by environmental exposure or drug administration. Alternatively, the 
5 disease or disorder can be due to a genetic defect. Dkk mediated diseases, disorders and 

conditions include but are not limited to bone mass disorders or conditions and lipid disorders 
and conditions. For example, bone mass disorders/conditions/diseases, which may be 
mediated by Dkk, LRP5 and/or LRP6 include but are not limited to age related loss of bone, 
bone fractures (e.g., hip fracture, Colle's fracture, vertebral crush fractures), 
10 chondrodystrophies, drug-induced disorders (e.g., osteoporosis due to administration of 

glucocorticoids or heparin and osteomalacia due to administration of aluminum hydroxide, 
anticonvulsants, or glutethimide), high bone turnover, hypercalcemia, hyperostosis, 
osteoarthritis, osteogenesis imperfecta, osteomalacia, osteomyelitis, osteoporosis, Paget's 
disease, and rickets. 

15 Lipid disorders/diseases/conditions, which may be mediated by Dkk, LRP5, and/or 

LRP6 include but are not limited to familial lipoprotein lipase deficiency, familial apoprotein 
CII deficiency, familial type 3 hyperlipoproteinemia, familial hypercholesterolemia, familial 
hypertriglyceridemia, multiple lipoprotein-type hyperlipidemia, elevated lipid levels due to 
dialysis and/or diabetes, and elevated lipid levels of unknown etiologies 

20 The term "recognizes and binds," when used to define interactions of antisense 

nucleotides or siRNA's (small inhibitory RNA) with a target sequence, means that a particular 
antisense or small inhibitory RNA (siRNA) sequence is substantially complementary to the 
target sequence, and thus will specifically bind to a portion of an mRNA encoding 
polypeptide. As such, typically the sequences will be highly complementary to the mRNA 

25 target sequence, and will have no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 base mismatches 

throughout the sequence. In many instances, it may be desirable for the sequences to be exact 
matches, i.e. be completely complementary to the sequence to which the oligonucleotide 
specifically binds, and therefore have zero mismatches along the complementary stretch. As 
such, highly complementary sequences will typically bind quite specifically to the target 

30 sequence region of the mRNA and will therefore be highly efficient in reducing, and/or even 
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inhibiting the translation of the target mRNA sequence into polypeptide product. 

Substantially complementary oligonucleotide sequences will be greater than about 80 
percent complementary (or x % exact-match^) to the corresponding mRNA target sequence to 
which the oligonucleotide specifically binds, and will, more preferably be greater than about 
85 percent complementary to the corresponding mRNA target sequence to which the 
oligonucleotide specifically binds. In certain aspects, as described above, it will be desirable 
to have even more substantially complementary oligonucleotide sequences for use in the 
practice of the invention, and in such instances, the oligonucleotide sequences will be greater 
than about 90 percent complementary to the corresponding mRNA target sequence to which 
the oligonucleotide specifically binds, and may in certain embodiments be greater than about 
95 percent complementary to the corresponding mRNA target sequence to which the 
oligonucleotide specifically binds, and even up to and including 96%, 97%, 98%, 99%, and 
even 100% exact match complementary to the target mRNA to which the designed 
oligonucleotide specifically binds. 

Percent similarity or percent complementary of any of the disclosed sequences may be 
determined, for example, by comparing sequence information using the GAP computer 
program, version 6.0, available from the University of Wisconsin Genetics Computer Group 
(UWGCG). The GAP program utilizes the alignment method of Needleman and Wunsch 
(1970). Briefly, the GAP program defines similarity as the number of aligned symbols (i.e., 
nucleotides or amino acids) which are similar, divided by the total number of symbols in the 
shorter of the two sequences. The preferred default parameters for the GAP program include: 
(1) a unary comparison matrix (containing a value of 1 for identities and 0 for non-identities) 
for nucleotides, and the weighted comparison matrix of Gribskov and Burgess (1986), (2) a 
penalty of 3.0 for each gap and an additional 0.10 penalty for each symbol in each gap; and 
(3) no penalty for end gaps. 

By "mimetic" is meant a compound or molecule that performs the same function or 
behaves similarly to the compound mimicked. 

By "reporter element" is meant a polynucleotide that encodes a polypeptide capable of 
being detected in a screening assays. Examples of polypeptides encoded by reporter elements 
include, but are not limited to, lacZ, GFP, Iuciferase, and chloramphenicol acetyltransferase. 
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2. Introduction 

The present invention also encompasses the HBM gene and HBM protein in the forms 
indicated by SEQ ID NO: 2 and 4, respectively, and other closely related variants, as well as 
the adjacent chromosomal regions of the HBM gene necessary for its accurate expression. In 
5 a preferred embodiment, the present invention is directed to at least 15 contiguous 

nucleotides of the nucleic acid sequence of SEQ ID NO: 2. More preferably, the present 
invention is directed to at least 15 contiguous nucleotides of the nucleic acid sequence of 
SEQ ID NO: 2, wherein one of the 15 contiguous nucleotides is the thymine at nucleotide 
582. 

10 The invention also relates to the nucleotide sequence of the Zmaxl (LRPS) gene 

region, as well as the nucleotide sequence of the HBM region. More particularly, a preferred 
embodiment are the BAC clones containing segments of the Zmaxl (LRPS) gene region 
B200E21-H and B527D12-H. A preferred embodiment is the nucleotide sequence of the 
BAC clones consisting of SEQ ID NOS: 5-12. 

15 The invention also concerns the use of the nucleotide sequence to identify DNA 

probes for the Zmaxl (LRPS) gene and the HBM gene, PCR primers to amplify the Zmaxl 
(LRPS) gene and the HBM gene, nucleotide polymorphisms in the Zmaxl (LRPS) gene and 
the HBM gene, and regulatory elements of the Zmaxl (LRPS) gene and the HBM gene. 

This invention describes the further localization of the chromosomal location of the 

20 Zmaxl (LRPS) gene and HBM gene on chromosome 1 lql3.3, between genetic markers 

D11S987 and SNP_CONTIG033-6, as well as the DNA sequences of the Zmaxl (LRPS) gene 
and the HBM gene. The chromosomal location was refined by the addition of more genetic 
markers to the mapping panel used to map the gene, and by the extension of the pedigree to 
include more individuals. The pedigree extension was critical because the new individuals 

25 that have been genotyped harbor critical recombination events that narrow the region. To 
identify genes in the region on 1 lql3.3, a set of BAC clones containing this chromosomal 
region was identified. The BAC clones served as a template for genomic DNA sequencing, 
and also as a reagent for identifying coding sequences by direct cDNA selection. Genomic 
sequencing and direct cDNA selection were used to characterize more than 1.5 million base 

30 pairs of DNA from 1 lql3.3. The Zmaxl (LRPS) gene was identified within this region and 
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the HBMgene was then discovered after mutational analysis of affected and unaffected 
individuals. 

When a gene has been genetically localized to a specific chromosomal region, the 
genes in this region can be characterized at the molecular level by a series of steps that 
include: cloning of the entire region of DNA in a set of overlapping clones (physical 
mapping), characterization of genes encoded by these clones by a combination of direct 
cDNA selection, exon trapping and DNA sequencing (gene identification), and identification 
of mutations in these genes by comparative DNA sequencing of affected and unaffected 
members of the HBM kindred (mutation analysis). 

Physical mapping is accomplished by screening libraries of human DNA cloned in 
vectors that are propagated in E. coli or S. cereviseae using PCR assays designed to amplify 
unique molecular landmarks in the chromosomal region of interest. To generate a physical 
map of the HBM candidate region, a library of human DNA cloned in Bacterial Artificial 
Chromosomes (BACs) was screened with a set of Sequence Tagged Site (STS) markers that 
had been previously mapped to chromosome 1 Iql2-ql3 by the efforts of the Human Genome 
Project. 

STSs are unique molecular landmarks in the human genome that can be assayed by 
PCR. Through the combined efforts of the Human Genome Project, the location of thousands 
of STSs on the twenty-two autosomes and two sex chromosomes has been determined. For a 
positional cloning effort, the physical map is tied to the genetic map because the markers 
used for genetic mapping can also be used as STSs for physical mapping. By screening a 
BAC library with a combination of STSs derived from genetic markers, genes, and random 
DNA fragments, a physical map comprised of overlapping clones representing all of the 
DNA in a chromosomal region of interest can be assembled. 

BACs are cloning vectors for large (80 kilobase to 200 kilobase) segments of human . 
or other DNA that are propagated in E. coli. To construct a physical map using BACs, a 
library of BAC clones is screened so that individual clones harboring the DNA sequence 
corresponding to a given STS or set of STSs are identified. Throughout most of the human 
genome, the STS markers are spaced approximately 20 to 50 kilobases apart, so that an 
individual BAC clone typically contains at least two STS markers. In addition, the BAC 
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libraries that were screened contain enough cloned DNA to cover the human genome six 
times over. Therefore, an individual STS typically identifies more than one BAC clone. By 
screening a six-fold coverage BAC library with a series of STS markers spaced 
approximately 50 kilobases apart, a physical map consisting of a series of overlapping BAC 
5 clones, i.e. BAC contigs, can be assembled for any region of the human genome. This map is 
closely tied to the genetic map because many of the STS markers used to prepare the physical 
map are also genetic markers. 

When constructing a physical map, it often happens that there are gaps in the STS 
map of the genome that result in the inability to identify BAC clones that are overlapping in a 

10 given location. Typically, the physical map is first constructed from a set of STSs that have 
been identified through the publicly available literature and World Wide Web resources. The 
initial map consists of several separate BAC contigs that are separated by gaps of unknown 
molecular distance. To identify BAC clones that fill these gaps, it is necessary to develop 
new STS markers from the ends of the clones on either side of the gap. This is done by 

15 sequencing the terminal 200 to 300 base pairs of the BACs flanking the gap, and developing 
a PCR assay to amplify a sequence of 100 or more base pairs. If the terminal sequences are 
demonstrated to be unique within the human genome, then the new STS can be used to screen 
the BAC library to identify additional BACs that contain the DNA from the gap in the 
physical map. To assemble a BAC contig that covers a region the size of the HBM candidate 

20 region (2,000,000 or more base pairs), it is often necessary to develop new STS markers from 
the ends of several clones. 

After building a BAC contig, this set of overlapping clones serves as a template for 
identifying the genes encoded in the chromosomal region. Gene identification can be 
accomplished by many methods. Three methods are commonly used: (1) a set of BACs 

25 selected from the BAC contig to represent the entire chromosomal region can be sequenced, 
and computational methods can be used to identify all of the genes, (2) the BACs from the 
BAC contig can be used as a reagent to clone cDNAs corresponding to the genes encoded in 
the region by a method termed direct cDNA selection, or (3) the BACs from the BAC contig 
can be used to identify coding sequences by selecting for specific DNA sequence motifs in a 

30 procedure called exon trapping. The present invention includes genes identified by the first 
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two methods. 

To sequence the entire BAC contig representing the HBM candidate region, a set of 
BACs was chosen for subcloning into plasmid vectors and subsequent DNA sequencing of 
these subclones. Since the DNA cloned in the BACs represents genomic DNA, this 
5 sequencing is referred to as genomic sequencing to distinguish it from cDNA sequencing. To 
initiate the genomic sequencing for a chromosomal region of interest, several non- 
overlapping BAC clones are chosen. DNA for each BAC clone is prepared, and the clones 
are sheared into random small fragments which are subsequently cloned into standard 
plasmid vectors such as pUC18. The plasmid clones are then grown to propagate the smaller 
10 fragments, and these are the templates for sequencing. To ensure adequate coverage and 
sequence quality for the BAC DNA sequence, sufficient plasmid clones are sequenced to 
yield six-fold coverage of the BAC clone. For example, if the BAC is 100 kilobases long, 
then phagemids are sequenced to yield 600 kilobases of sequence. Since the BAC DNA was 
randomly sheared prior to cloning in the phagemid vector, the 600 kilobases of raw DNA 

15 sequence can be assembled by computational methods into overlapping DNA sequences 
termed sequence contigs. For the purposes of initial gene identification by computational 
methods, six-fold coverage of each BAC is sufficient to yield ten to twenty sequence contigs 
of 1000 base pairs to 20,000 base pairs. 

The sequencing strategy employed in this invention was to initially sequence "seed 11 

20 BACs from the BAC contig in the HBM candidate region. The sequence of the "seed" BACs 
was then used to identify minimally overlapping BACs from the contig, and these were 
subsequently sequenced. In this manner, the entire candidate region was sequenced, with 
several small sequence gaps left in each BAC. This sequence served as the template for 
computational gene identification. One method for computational gene identification is to 

25 compare the sequence of BAC contig to publicly available databases of cDNA and genomic 
sequences, e.g., unigene, dbEST, GenBank. These comparisons are typically done using the 
BLAST family of computer algorithms and programs (Altschul et al 9 J. Mol Biol, 215:403- 
410 (1990)). The BAC sequence can also be translated into protein sequence, and the protein 
sequence can be used to search publicly available protein databases, using a version of 

30 BLAST designed to analyze protein sequences (Altschul et al 9 1997 Nucl Acids Res. 25: 
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3389-3402). Another method is to use computer algorithms such as MZEF (Zhang, 1997 
Proc. Natl Acad. Set USA, 94: 565-568) and GRAIL (Uberbacher et aL, 1996 Methods 
Enzymol. 266: 259-281), which predict the location of exons in the sequence based on the 
presence of specific DNA sequence motifs that are common to all exons, as well as the 
presence of codon usage typical of human protein encoding sequences. 

In addition to identifying genes by computational methods, genes were also identified 
by direct cDNA selection (Del Mastro et aL, 1995 Genome Res. 5(2): 185-194). In direct 
cDNA selection, cDNA pools from tissues of interest are prepared, and the BACs from the 
candidate region are used in a liquid hybridization assay to capture cDNA which base-pairs to 
coding regions in the BAC. In the methods described herein, the cDNA pools were created 
from several different tissues by random priming the first strand cDNA from poly-A RNA, 
synthesizing the second strand cDNA by standard methods, and adding linkers to the ends of 
the cDNA fragments. The linkers are used to amplify the cDNA pools. The BAC clones are 
used as a template for in viti-o DNA synthesis to create a biotin labeled copy of the BAC 
DNA. The biotin labeled copy of the BAC DNA is then denatured and incubated with.an 
excess of the PCR amplified, linkered cDNA pools which have also been denatured. The 
BAC DNA and cDNA are allowed to anneal in solution, and heteroduplexes between the 
BAC and the cDNA are isolated using streptavidin coated magnetic beads. The cDNA which 
is captured by the BAC is then amplified using primers complimentary to the linker 
sequences, and the hybridization/selection process is repeated for a second round. After two 
rounds of direct cDNA selection, the cDNA fragments are cloned, and a library of these 
direct selected fragments is created. 

The cDNA clones isolated by direct selection are analyzed by two methods. Since a 
pool of BACs from the HBM candidate region is used to provide the genomic DNA 
sequence, the cDNAs must be mapped to individual BACs. This is accomplished by arraying 
the BACs in microtiter dishes, and replicating their DNA in high density grids. Individual 
cDNA clones are then hybridized to the grid to confirm that they have sequence identity to an 
individual BAC from the set used for direct selection, and to determine the specific identity 
of that BAC. cDNA clones that are confirmed to correspond to individual BACs are 
sequenced. To determine whether the cDNA clones isolated by direct selection share 



35 



WO 02/092000 PCT/US02/14877 



sequence identity or similarity to previously identified genes, the DNA and protein coding 
sequences are compared to publicly available databases using the BLAST family of 
programs. 

The combination of genomic DNA sequence and cDNA sequence provided by BAC 
sequencing and by direct cDNA selection yields an initial list of putative genes in the region. 
The genes in the region were all candidates for the HBM locus. To further characterize each 
gene, Northern blots were performed to determine the size of the transcript corresponding to 
each gene, and to deteimine which putative exons were transcribed together to make an 
individual gene. For Northern blot analysis of each gene, probes were prepared from direct 
selected cDNA clones or by PCR amplifying specific fragments from genomic DNA or from 
the BAC encoding the putative gene of interest. The Northern blots gave information on the 
size of the transcript and the tissues in which it was expressed. For transcripts which were 
not highly expressed, it was sometimes necessary to perform a reverse transcription PCR 
assay using RNA from the tissues of interest as a template for the reaction. 

Gene identification by computational methods and by direct cDNA selection provides 
unique information about the genes in a region of a chromosome. When genes are identified, 
then it is possible to examine different individuals for mutations in each gene. 

The present invention also encompasses the HBM gene and HBM protein in the forms 
indicated by SEQ ID NO: 2 and 4, respectively, and other closely related variants, as well as 
the adjacent chromosomal regions of the HBM gene necessary for its accurate expression. In 
a preferred embodiment, the present invention is directed to an isolated nucleic acid sequence 
of SEQ ID NO: 2, as well as variants thereof. Variants include changes in SEQ ID NO:l 
which result in a HBM like phenotype. Examples of such variants are discussed further in 
Section 3 below and in the examples. These variants preferably have at least about 90%, 
preferably at least about 95%, or more preferably at least about 98% or more similarity or 
identity to the nucleic acid sequence of SEQ ID NOS: 1 or 2 or biologically active fragments 
thereof. Therefore, sequences which are 96%, 97%, and 99% or more similar to SEQ ID 
NOS: 1 or 2 or biologically active fragments thereof are also contemplated herein. 

Determination of the degree of variation between a high bone mass (HBM) variant 
can be performed using BLAST or FASTA or other suitable algorithm using standard default 
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parameters. Preferably, identity will be determined for coding regions of SEQ ID NOS: 1-2, 
but can also include non-coding domains. Additionally, alignment programs can be used to 
identify conserved sequences or potential motifs across different animal species. Alignment 
programs can also be used to align the nucleic acid and/or protein sequences of related genes 
5 and the proteins that they encode. Preferred alignment programs include CLUSTALW, 

PILEUP and GAP, and would preferably be used with default parameters. For example, such 
programs can be used to align the sequences of Zmaxl (LRP5), HBM, and LDL receptor- 
related protein 6 (LRP6) and sequences related thereto. 

By a polynucleotide having a nucleotide sequence at least, for example, 90% "similar" 

10 to a reference nucleotide sequence encoding a polypeptide, is intended that the nucleotide 
sequence of the polynucleotide is identical to the reference sequence except that the 
polynucleotide sequence may include up to ten point mutations per each 100 nucleotides of 
the reference nucleotide sequence. These mutations of the reference sequence may occur at 
any location in SEQ ID NO: 1 or 2 or in the LRP6 gene. The mutations may be silent. 

1 5 Another embodiment contemplates that such polynucleotide variants of SEQ ID NO: 

1 or 2 comprise nucleic acid sequences which are at least 15, 20, 25, 30, 35, 40, 45, 50, 60, 
70, 80, 90, 100, 150, 200, 300, 400, or 500 contiguous nucleotides of SEQ ID NO: 1 or 2. 
More preferably, such polynucleotide variants have a contiguous nucleic acid sequence 
corresponding with the polymorphism at nucleotide 582 (G-T substitution) of SEQ ED NO: 2 

20 or other variants of SEQ ID NO: 1 or 2, which comprise a mutation which modulates bone 
mass and/or lipid levels when the polypeptide encoded thereby is administered to a subject. 
All variants of SEQ ID NO: 1 or 2 contemplated possess the characteristic of encoding a 
protein or polypeptide which when administered to a subject induces bone modulation. 
Additional variants which may be responsible for modulating bone mass when administered 

25 to a subject may lie within the domain known to contain the HBM polymorphism and which 
encodes the beta propeller domain (YWTD motifs). Alternatively, other variants of Zmaxl 
(LRP5) which modulate high bone mass in a subject may be due to mutations in the nucleic 
acid sequences encoding any of the other conserved domains of Zmaxl (LRP5), such as those 
set forth in Figure 4 (e.g., the RGD extracellular attachment site, the binding site for LDL and 

30 calcium, the cysteine rich growth factor repeats, the ideal PEST region, and the 
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internalization domain). See Section 3 and the Examples below for additional mutations 
which may confer enhance bone mass and/or lipid modulation. 

HBM polynucleotides and HBM like variants contemplated include those which 
hybridize under stringent conditions to SEQ ID NO: 2. Hybridization methods are known in 
5 the art and include, but are not limited to: (a) washing with 0.1X SSPE (0.62 M NaCl, 0.06 M 
NaH 2 P0 4 -H 2 0, 0.075 M EDTA, pH 7.4) and 0.1% sodium dodecyl sulfate (SDS) at 50°C; (b) 
washing with 50% formamide, 5X SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM 
sodium phosphate (pH 6-8), 0.1% sodium pyrophosphate, 5X Denhardt's solution, sonicated 
salmon sperm DNA (50 Mg/ml), 0.1% SDS and 10% dextran sulfate at 42°C, followed by 

10 washing at 42°C in 0.2X SSC and 0.1% SDS; and (c) washing with of 0.5 M NaP0 4 , 7% 
SDS at 65°C followed by washing at 60°C in 0.5X SSC and 0.1% SDS. Additional 
conditions under which HBM variants can be isolated by hybridization to SEQ ID NO: 2 or 
nucleic acid fragments thereof can be performed by varying the hybridization temperature. 
High stringency hybridization conditions are those performed at about 20 °C below the 

1 5 melting temperature (T J of SEQ ID NO: 2 or fragments thereof. Preferred stringency is 

performed at about 5-10°C below the T m of SEQ ID NO: 2 or fragments thereof. Additional 
hybridization conditions can be prepared as described in Chapter 1 1 of Sambrook et ah ., 
Molecular Cloning: A Laboratory Manual (1989), or as would be known to the artisan of 
ordinary skill. 

20 Alternatively, mammalian libraries (e.g., equine, primate, caprine, bovine, ovine, 

feline, porcine, and canine) can be probed using degenerate primers and polymerase chain 
reaction (PCR) techniques to identify variants of SEQ ID NO: 2 or fragments thereof. 
Preferably primers are utilized which hybridize under stringent conditions to the open reading 
frame of SEQ ID NO: 2, or to non-coding portions of the sequence. More preferably, such 

25 primers hybridize to conserved domains within SEQ ID NO: 2. For example, conserved 

domains include those coding for the YWTD beta-propeller domains or other domains, such 
as those listed in Figure 4. Preferred primers are typically 15 nucleotides in length, but can 
vary to be at least, about 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40 or 50 (and any range 
in between) nucleotides in length. Heterologous hybridization is to amplify the target gene or 

30 nucleic acid sequence using degenerate PCR primers. Probes for variants of SEQ ID NO: 2 
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and the polypeptide encoded thereby can be obtained by preparing mixed oligonucleotides of 
greater than 10, preferably of 15 or more, nucleotides in length representing all possible 
nucleotide sequences which could encode the corresponding amino acid sequences (e.g., SEQ 
ID NO: 4 fragments thereof). This method is clearly documented by Gould et al, 1989 Proc. 
Natl Acad. Set USA 86(6): 1934-8. 

Another embodiment includes nucleic acids which encode an HBM polypeptide or 
HBM like variant which is at least about 90% similar to SEQ ID NO: 4 and fragments 
thereof, and which when administered to a subject modulate bone mass in that subject. HBM 
variants may have a valine corresponding to position 171 of SEQ ID NO: 4 (Gly to Val 
substitution) or 170 of the mouse homology or an amino acid change elsewhere in propeller 1 
or in the protein which results in enhanced bone mass and/or lipid modulation. Other 
preferred embodiments include high bone mass polypeptides which have at least about 10, 
15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 500 or more 
contiguous amino acids of SEQ ID NO: 3 or 4, with a mutation in the sequence resulting in. 
enhanced bone mass and/or modulated lipid levels. Such contemplated contiguous sequences 
preferably overlap with a polymorphism corresponding to high bone mass, such as valine- 174 
of SEQ ID NO: 4, other mutations in propeller 1 or is predicted from the model provided in : 
the Examples. Also contemplated are the polynucleotides encoding polypeptides which are at 
least about 95%, 96%, 97%, 98% and 99% or more similar to SEQ ID NO: 3 or 4 and 
fragments thereof, wherein these polypeptide contain at least one mutation (e.g., valine-171 
or like mutation). 

In another embodiment, a synthetic nucleic acid encoding SEQ ID NO: 4 is 
contemplated wherein the nucleic acid sequence has been conservatively substituted based on 
the degeneracy of the code such that no amino acids are altered in SEQ ID NO: 4, but perhaps 
wherein the resulting synthetic polynucleotide encoding said SEQ ID NO: 4 is one that is at 
least, about 50% similar to SEQ ID NO: 2. Alternatively, SEQ ID NO: 4 may contain any of 
the silent mutations identified in Section 3 below . 

By a polypeptide having an amino acid sequence of at least, for example, 95% 
"identity" to a reference amino acid sequence of SEQ ID NO: 3 or 4 or fragment thereof is 
intended that the amino acid sequence of the polypeptide is identical to the reference 
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sequence except that the polypeptide sequence may include up to five amino acid alterations 
per each 100 amino acids of the reference amino acid sequence of SEQ ID NO: 3 or 4 and has 
an HBM like phenotype. In other words, to obtain a polypeptide having an amino acid 
sequence 95% identical to a reference amino acid sequence, up to 5% of the amino acid 
5 residues in the reference sequence may be deleted or substituted with another amino acid, or a 
number of amino acids up to 5% of the total amino acid residues in the reference sequence 
may be inserted into the reference sequence. These alterations of the reference sequence may 
occur at the amino or carboxy terminal positions of the reference amino acid sequence or 
anywhere between those terminal positions, interspersed either individually among residues 

1 0 in the reference sequence or in one or more contiguous groups within the reference sequence. 
Additional HBM polypeptides and nucleic acids which encode said HBM 
polypeptides are contemplated wherein amino acid residues are conservatively substituted. 
For example, guidance concerning how to make phenotypically silent amino acid 
substitutions is provided in Bowie et al, "Deciphering the Message in Protein Sequences: 

15 Tolerance to Amino Acid Substitutions," Science 247: 1306-10 (1990), wherein the authors 
indicate that there are two main approaches for studying the tolerance of an amino acid 
sequence to change. The first method relies on the process of evolution, in which mutations 
are either accepted or rejected by natural selection. The second approach uses genetic 
engineering to introduce amino acid changes at specific positions of a cloned gene and 

20 selections or screens to identify sequences that maintain functionality. These studies have 
revealed that proteins are surprisingly tolerant of amino acid substitutions. The authors 
further indicate which amino acid changes are likely to be permissive at a certain position of 
the protein. Numerous phenotypic substitutions are described in Bowie et aL, supra, and the 
references cited therein, which are herein incorporated by reference in their entirety. 

25 Preferred substitutions would be in domains which are less conserved across species, and 
which do not correspond to a structurally or functionally important domain (e.g., a binding 
site, catalytic site, or beta propeller or other domain described in Figure 4). 

Recent studies have indicated that LRP5 participates in the Wht signal transduction 
pathway. The Wnt pathway is critical in limb early embryological development. A recently 

30 published sketch of the components of Wnt signaling is shown in Fig. 28 (Nusse, 2001 
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http://www.stanford.edu/~musse/pathways/cell2. html) (see also, Nusse, 2001 Nature 411: 
255-6; and Mao et al, 2001 Nature 41 1: 321-5). 

Briefly summarized, Wnt proteins are secreted proteins which interact with the 
transmembrane protein Frizzled (Fz). LRP proteins, such as LRP5 and LRP6, are believed to 
5 modulate the Wnt signal in a complex with Fz (Tamai et al , 2000 Nature 407: 530-5). The 
Wnt pathway acts intracellularly through the Disheveled protein (Dsh) which in turn inhibits 
glycogen synthetase kinase-3 (GSK3) from phosphorylating P-catenin. Phosphorylated (3- 
catenin is rapidly degraded following ubiquitination. However, the stabilized p-catenin 
accumulates and translocates to the nucleus where it acts as a co factor of the T-cell factor 

1 0 (TCF) transcription activator complex. 

The protein dickkopf-1 (Dkk-1) is reported to be an antagonist of Wnt pathway. Dkk- 
1 is required for head formation in early development (Glinka et al, 1998 Nature 391: 357- 
62). Dkk-1 and its function in the Wnt pathway are described in e.g., Krupnik, et al, 1999 
Gene 238: 301-13; Fedi et al, 1999 J. Biol Chem. 21 A: 19465-72; see also for Dkk-1 and.the 

15 Wnt pathway, Wu et al, 2000 Curr. Biol 10: 1611-4; Shinya et al, 2000 Meek Dev. 98: 3- 
17; Mukhopadhyay et al, 2001 Dev. Cell 1: 423-434; and in PCT Patent Application No: ' 
WO 00/52047, and in references cited in each. It has been known that Dkk-1 acts upstream 
of Dsh, however the nature of the mechanism of inhibition by Dkk-1 is just beginning to be 
elucidated. Dkk-1 is expressed in the mouse embryonic limb bud and its disruption results in 

20 abnormal limb morphogensis, among other developmental defects (Gotewold et al, 1999 
Mech. Dev. 89:151-3; and Mukhopadhyay etal, 2001 Dev. Cell 1: 423-34). 

Related U.S. Serial No. 60/291,31 1 (herein incorporated by reference in its entirety) 
disclosed a novel interaction between Dkk-1 (GenBank Accession No. XM 005730) and 
LRP5. The interaction between Dkk-1 and LRP5 was discovered by a yeast two hybrid 

25 (Y2H) screen for proteins which interact with the ligand binding domain of LRP5. The two- 
hybrid screen is a common procedure in the art, which is described, for example, by Gietz et 
al, 1997 Mol. Cell Biochem. 172: 67-79; Young, 1998 Biol Reprod. 58: 302-11; Brent et 
al, 1997 Ann. Rev. Genet. 31 : 663-704; and Lu et al, eds., V^ast Hybrid Technologies , 
Eaton Publishing, Natick MA, (2000). More recently, other studies confirm that Dkk-1 is a 

30 binding partner for LRP and modulates the Wnt pathway via direct binding with LRP (R. 
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Nusse, 2001 Nature 411: 255-6; Bafico etal, 2001 Nat. Cell Biol. 3: 683-6; Semenov, 2001 
Curr. Biol. 11: 951-61; Mao, 2001 Nature 411: 321-5 (2001); Zorn, 2001 Curr. Biol 11: 
R592-5); and Li et al, 2002 J. Biol Chem. 277: 5977-81). 

Mao and colleagues (2001) identified Dkk-1 as a ligand for LRP6. Mao et al suggest 
5 that Dkk-1 and LRP6 interact antagonistically where Dklc proteins inhibit the Wnt coreceptor 
functions of LRP6. Using co-immunoprecipitation, the group verified that the Dkk-1/LRP6 
interaction was direct. Dkk-2 was also found to directly bind LRP6. Contrary to data 
contained in provisional application 60/291,311, Mao et al report that no interaction was 
detected between any Dkk protein and LRP5, as well as no interaction with LDLR, VLDLR, 

10 ApoER, or LRP). Additionally, Mao et al demonstrated that LRP6 can titrate Dkk-l's 

effects of inhibiting Wnt signaling using the commercial TCF-luciferase reporter gene assay 
(TOPFLASH). A similar conclusion was drawn from analogous studies in Xenopus embryos. 
Deletion analyses of LRP6 functional domains revealed that EGF repeats (beta-propellers) 3 
and 4 were necessary for Dkk-1 binding and that the ligand binding domains of LRP6 had no 

15 effect on Dkk-1 binding. The findings of Mao et al contrast with data obtained by the 

present inventors indication that the ligand binding domains of LRP5 were necessary and 
sufficient for Dkk-1 binding in yeast. Using classical biochemical ligand-receptor studies, 
Mao et al determined a Kd=0.34 nM for Dkk-1 /LRP 6 and a Kd-0.73 nM for Dkk-2/LRP6. 
Semenov et al (2001) verified the Mao group's results and confirmed by 

20 coimmunoprecipitation that Dkk-1 does not directly bind to Wnt or Frizzled but rather 
interacts with LRP6. Their Scatchard analyses found a Kd=0.5 nM for Dkk-1 /LRP 6. 
Semenov et al also demonstrated that Dkk-1 could abolish an LRP5/Frizzled8 complex 
implying that Dkk-1 can also repress Wnt signaling via interactions with LRP5. A Dkk-1 
mutant where cysteine 220 was changed to alanine abolished LRP6 binding and was unable 

25 to repress Wnt signaling. Studies in Xenopus embryos confirmed the results and revealed a 
functional consequence of Dkk-1 /LRP6: repression of Wnt signaling. Their Xenopus work 
also suggested that LRP6/Dkk-1 may be specific for the canonical, p-catenin-mediated, Wnt 
pathways as opposed to the Wnt Planar Cell Polarity pathway. 

Bafico et al (2001) employed a 125 I-labeled Dkk-1 molecule to identify LRP6 as its 

30 sole membrane receptor with a Kd=0.39 nM. Again, the functional consequences of the Dkk- 
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1/LRP6 interaction was a repression of the canonical Wnt signaling even when Dkk-1 was 
added at extremely low concentrations (30 pM). 

Not wishing to be bound by theory, it is believed that the present invention provides 
an explanation for the mechanism of Dkk-1 inhibition of the Wnt pathway and provides a 
5 mechanism whereby the Wnt pathway may be modulated. The present application and 

related U.S. Serial No. 60/291,3 1 1 (which is herein incorporated by reference in its entirety) 
describe Dkk-1 /LRP5 interactions and demonstrate that the interaction between 
LRP5/LRP6/HBM, and Dkk can be used in a method as an intervention point in the Wnt 
pathway for an anabolic bone therapeutic or a modulator of lipid metabolism. 

10 As detailed in the Examples, Dkk-1 is able to repress LRP5-mediated Wnt signaling 

but not HBM-mediated Wnt signaling. This observation is of particular interest because the 
HBM mutation in LRP5 is a gain of function or activation mutation. That is, Wnt signaling, 
via the canonical pathway, is enhanced with HBM versus LRP5. Thus, other HBM-like 
mutations would also function similarly. The present data suggest the mechanism of this 

15 functional activation: the inability of Dkk-1 to repress HBM-mediated Wnt signaling. 

Further investigations of other Wnt or Dkk family members show differential activities in-the 
canonical Wnt pathway that demonstrate the complexity and variability in Wnt signaling that 
can be achieved depending on the LRP/Dkk/Wnt/Frizzled repertoire that is expressed in a. 
particular cell or tissue. This may attest to the apparent bone specificity of the HBM or HBM 

20 like phenotypes in humans and in the HBM transgenic animals. 

Furthermore, the present data reveal the importance and functional consequence for 
the potential structural perturbation of the first beta-propeller domain of LRP5. Our data 
identified the ligand binding domain of LRP5 as the interacting region with Dkk-1 while the 
Mao et ah publication demonstrated the functional role of propellers 3 and 4 in their 

25 LRP6/Dkk-1 studies. In the present invention, we implicate the first beta propeller domain, 
via the HBM mutation at residue 171, as having a functional consequence in the Dkk-1- 
mediated Wnt pathway. The involvement of position 171 of propeller 1 may be direct or 
indirect with Dkk-1. Direct involvement could arise from perturbations of the 3-dimensional 
structure of the HBM extracellular domain that render Dkk-1 unable to bind. Alternatively, 

30 residue 171 of propeller 1 may directly interact with Dkk-1; however, by itself, it is 
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insufficient to bind and requires other LRP5 domains. Potential indirect candidate molecules 
may be among the proteins identified the Dkk-1 yeast-two-hybrid experiments. 

It may be that the disruption of Dkk activity is not necessarily mediated by enhancing 
or preventing the binding of Dkk to LRP5/LRP6/HBM. More than one mechanism may be 
5 involved. Indeed, the inventors have observed that Dkk-1 binds LRP5, LRP6, and HBM. It 
is able to effectively inhibit LRP6, and to a slightly lesser extent, LRP5 activity. Further, has 
been observed that different members of the Dkk family differentially affect 
LRP5/LRP6/HBM activity. For example, Dkk-1 inhibits LRP5/LRP6/HBM activity while 
another Dkk may enhance LRP5/LRP6/HBM activity. An endpoint to consider is the 

1 0 modulation of the LRP5/LRP6/HBM activity, not simply binding. 

The present disclosure shows that targeting the modulation of the Dkk-1/LRP5 
interaction is a therapeutic intervention point for an HBM or HBM-like mimetic agent. A 
therapeutic agent of the invention may be a small molecule, peptide or nucleic acid aptamer, 
antibody, or other peptide/protein, etc. Methods of reducing Dkk-1 expression may also be 

15 therapeutic using methodologies such as: RNA interference (i.e., siRNAs), small hairpin 

RNAs (shRNAs), antisense oligonucleotides, morpholino oligonucleotides, PNAs, antibodies 
to Dkk-1 or Dkk-1 interacting proteins, decoy or scavenger LRP5 or LRP6 receptors, and 
knockdown of Dkk-1 or Dkk-1 interactor transcription. For discussion of small hairpin 
RNAs, see Yu et al. 9 2002 Proc. Natl. Acad. Set USA 99: 6047-52; Tuschl, 2002 Nature 

10 Biotech. 20: 446-8; Lee et aL, 2002 Nature Biotech. 19: 500-5; Paddison et aL, 2002 Genes 
& Devel 16: 948-58; and Brummelkamp et aL, 2002 Science 296: 550-3. 

In an embodiment of the present invention, the activity of Dkk-1 or the activity of a 
Dkk-1 interacting protein may be modulated for example by binding with a peptide aptamer 
of the present invention. In another embodiment, LRP5 activity may be modulated by a 

15 reagent provided by the present invention (e.g., a peptide aptamer). In another embodiment, 
the Dkk-1/LRP5 interaction may be modulated by a reagent of the present invention (e.g., a 
Dkk-1 interacting protein such as those identified in Fig. 3 1). In another embodiment, the 
Wnt signal transduction pathway may be modulated by use of one or more of the above 
methods. In a preferred embodiment of the present invention, the Dkk-1 mediated activity of 

JO the Wnt pathway may be specifically modulated by one or more of the above methods. In 
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another preferred embodiment of the present invention, the Wnt signal transduction pathway 
may be stimulated by down-regulating Dkk-1 interacting protein activity; such down- 
regulation could, for example, yield greater LRP5 activity. In a more preferred embodiment, 
by stimulating LRP5 activity, bone mass regulation may be stimulated to restore or maintain 
5 a more optimal level. In another preferred embodiment, by stimulating LRP5 activity, lipid 
metabolism may be stimulated to restore or maintain a more optimal level. Alternative 
embodiments provide methods for screening candidate drugs and therapies directed to 
correction of bone mass disorders or lipid metabolism disorders. And, preferred 
embodiments of the present invention provide drugs and therapies developed by the use of 

! 0 the reagents and/or methods of the present invention. One skilled in the art will understand 
that the present invention provides important research tools to develop an effective model of 
osteoporosis, to increase understanding of bone mass and lipid modulation, and to modulate 
bone mass and lipid metabolism. For a more detailed description of Dkk-1 and Dkk-1 
interacting protein modulation, please refer to U.S. Serial No. 60/361,293, which is herein 

1 5 incorporated by reference in its entirety. 

3. Alternati ve Va riants of LRP5/LRP 6 H aving HBM Activity 

A structural model of the LRP5/Zmaxl first beta-propeller module was generated 
based on a model prediction in Springer et ah, 1998 J. Mol Biol 283: 837-862. Based on the 

!0 model, certain amino acid residues were identified as important variants of 

LRP5/HBM/Zmaxl . The model and modifications thereof are discussed in more detail in the 
Examples. The following three categories provide examples of such variants: 

The shape of the beta-propeller resembles a disk with inward-sloping sides and a hole 
down the middle. Residue 171 is in a loop on the outer or top surface of the domain in blade 

15 4 of propeller module 1. Thus, variants comprising changed residues in structurally 

equivalent positions in other blades; as well as residues that are slightly more interior to the 
binding pocket, but still accessible to the surface, are important embodiments of the present 
invention for the study of bone mass modulation by LRP5/HBM, for the development of 
pharmaceuticals and treatments of bone mass disorders, and for other objectives of the 

JO present invention. The following Table contains examples of such variants: 
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TABLE 1 

Effect nfMntatinn 

a position equivalent to 171 in blade 5 of propeller 1; alanine is not conserved 
in other propellers 

a position equivalent to 171 in blade 3 of propeller 1; glutamate is not 
conserved in other propellers 

a position equivalent to 171 in blade 2 of propeller 1; alanine is conserved in 
propellers 1-3 but not 4 

an accessible interior position in blade 5 of propeller 1; glycine is conserved in 
propellers 1-3 but not 4 

accessible interior position in blade 1 of propeller 1; methionine is conserved 
in propellers 1-3 but not 4 

These mutations were further analyzed based on a more sophisticated model, as discussed 
and described in Example 1 1 below. 

LRP5/Zmaxl has four beta-propeller structures; the first three beta-propeller modules 
conserve a glycine in the position corresponding to residue 171 in human LRP5/Zmaxl. 
Therefore, variants bearing a valine in the equivalent positions in the other propellers are 
important embodiments of the present invention. The following variants are useful for the 
study of bone mass modulation by LRP5/HBM, for the development of pharmaceuticals and 
treatments of bone mass disorders, and for other objectives of the present invention: G479V, 
G781V, and Q1087V of SEQ ID NO: 3, which demonstrate that propeller 1 is an important 
determinant of an HBM or HBM-like effect. 

The G171 V HBM polymorphism (SEQ ID NO: 4) results in "occupied space" of the 
beta-propeller 1, with the side-chain from the valine residue sticking out into an open binding 
pocket and potentially altering a ligand/protein interaction. The glycine residue is conserved 
in LRP5/Zmaxl propellers 1, 2 and 3 but is a glutamine in propeller 4. Therefore, the 
following variants of LRP5/HBM are important embodiments of the present invention for the 
study of bone mass modulation by LRP5/HBM, for the development of pharmaceuticals and 
treatments of bone mass disorders, and for other objectives of the present invention: 

G171K: introduces a charged side-chain 
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G171F: introduces a ringed side-chain 
G171I: introduces a branched side-chain 
G171Q: introduces the propeller 4 residue 
These substitutions along with substitutions in other regions of propeller 1 of SEQ ID NO: 3 
5 (i.e., A214V and M282V) have been shown to produce an HBM-like effect by TCF assay 

(Figs. 29 and 30). Thus these substitutions in other propeller 1 domains would similarly have 
an expectation of producing an HBM-like phenotype readily assayable by TCF assay. 

Furthermore, LRP6 is the closest homolog of LRP5/Zmaxl. LRP6 has a beta- 
propeller structure predicted to be similar, if not identical to Zmaxl . The position 
L0 corresponding to glycine 171 of human LRP5/Zmaxl is glycine 158 of human LRP6. Thus, 
corresponding variants of LRP6 are an important embodiment of the present invention for the 
study of the specificity of LRP5/Zmaxl versus its related family member, for the 
development of pharmaceuticals and treatments of bone mass disorders, and for other 
objectives of the present invention. Specifically, for example, a glycine to valine substitution 
15 at the structurally equivalent position, residue 158, of human LRP6 and similar variants of 
other species' LRP6 homologs represent important research tools. 

Site-directed mutants of LRP5 were generated in the full-length human LRP5 cDNA 
using the QuikChange XL-Site-Directed Mutagenesis Kit (catalog #200516, Stratagene, La 
Jolla, CA) following the manufacturer's protocol. The mutant sequences were introduced 
20 using complementary synthetic oligonucleotides: 

Mutation Complementary oligos 

A65V: S'-TGGTCAGCGGCCTGGAGGATGTGGCCGCAGTGGACTTCC-S 1 
S'-GGAAGTCCACTGCGGCCACATCCTCCAGGCCGCTGACCA-S 1 

E128V 5 AAGCTGTACTGGACGGACTC AGTGACC AACCGC ATCG AGG-3 ' 
S'-CCTCGATGCGGTTGGTCACTGAGTCCGTCCAGTACAGCTT-S 1 

G171K 5 ATGTACTGGAC AG ACTGG AAGGAG ACGCCCCGG ATTGAGCG-3 1 
5 -CGCTC AATCCGGGGCGTCTCCTTCCAGTCTGTCC AGTACAT-3 1 

IS G171F 5-ATGTACTGGACAGACTGGTTTGAGACGCCCCGGATTGAGCG-3' 

5 ' -CGCTC AATCCGGGGCGTCTCAAACCAGTCTGTCCAGTACAT-3 ' 

Gl 711 5 , -ATGTACTGGACAGACTGGATTGAGACGCCCCGGATTGAGCG-3 , 
S'-CGCTCAATCCGGGGCGTCTCAATCCAGTCTGTCCAGTACAT-S 1 
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Mutation Complementary nlig ns 

G171Q 5-ATGTACTGGACAGACTGGCAGGAGACGCCCCGGATTGAGCG-3' 5'- 
CGCTCAATCCGGGGCGTCTCCTGCCAGTCTGTCCAGTACAT-3' 

Gl 99V 5'-CGGACATTTACTGGCCCAATGTACTGACCATCGACCTGGAGG-3' 
5'-CCTCCAGGTCGATGGTCAGTACATTGGGCCAGTAAATGTCCG-3' 

A214V 5-AGCTCTACTGGGCTGACGTCAAGCTCAGCTTCATCCACCG-3' 
5'-CGGTGGATGAAGCTGAGCTTGACGTCAGCCCAGTAGAGCT-3' 

M282V 5'-GAGTGCCCTCTACTCACCCGTGGACATCCAGGTGCTGAGCC-3' 
5'-GGCTCAGCACCTGGATGTCCACGGGTGAGTAGAGGGCACTC-3' 

G479V 5'-CATGTACTGGACAGACTGGGTAGAGAACCCTAAAATCGAGTGTGC-3' 
5'-GCACACTCGATTTTAGGGTTCTCTACCCAGTCTGTCCAGTACATG-3' 

G781V 5-CATCTACTGGACCGAGTGGGTCGGCAAGCCGAGGATCGTGCG-3 1 
5'-CGCACGATCCTCGGCTTGCCGACCCACTCGGTCCAGTAGATG-3* 

Ql 087V 5-GTACTTCACCAACATGGTGGACCGGGCAGCCAAGATCGAACG-3' 
5'-CGTTCGATCTTGGCTGCCCGGTCCACCATGTTGGTGAAGTAC-3' 

G158V of 5'-GTACTGGACAGACTGGGTAGAAGTGCCAAAGATAGAACGTGC-3' 
LRP6 5'-GCACGTTCTATCTTTGGCACTTCTACCCAGTCTGTCCAGTAC-3' 



All constructs were sequence verified to ensure that only the engineered modification 
was present in the gene. Once verified, each variant was functionally evaluated in the TCF- 
luciferase assay in U20S cells (essentially as described in Example 6. Other functional 
evaluations could also be performed, such as the Xenopus embryo assay (essentially as 
described in Example 5), or other assays to evaluate Wnt signaling, Dkk modulation, or 
anabolic bone effect. Binding of these mutants to Dkk, LRP-interacting proteins, Dkk- 
interacting proteins, or peptide aptamers to any of the preceding could also be investigated in 
a variety of ways such as in a two-hybrid system (such as in yeast as described in this 
application), or other methods. 

Figure 29 shows the effects of the G171F mutation in propeller 1 of LRP5. This 
mutation is at the same position as HBM's G171V substitution. Expression of G171F results 
in an HBM effect. That is, in the presence of Wnt, G171F is able to activate the TCF- 
luciferase reporter construct. In fact, it may activate the reporter to a greater extent than 
either LRP5 or HBM. Furthermore, in the presence of Dkkl and Wntl, G171F is less 
susceptible than LRP5 to modulation by Dkk. These data exemplify that the G171F variant 
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modulates Wnt signaling in a manner similar to HBM. In addition, this data confirms that 
HBM's valine residue at 171 is not the only modification at 171 that can result in an HBM 
effect. Together these data support an important role for LRP5 propeller 1 in modulating 
Wnt pathway activity; in responding to Dkk modulation; and, in the ability to generate an 
5 HBM effect. 

Figure 30 shows the effects of the M282V mutation in propeller 1 of LRP5. M282 
expression results in an HBM-effect. That is, in the presence of Wnt, M282 is able to 
activate the TCF-luciferase reporter construct. Furthermore, in the presence of Dkkl and 
Wntl, M282V is less susceptible than LRP5 to modulation by Dkk. These data show that the 
10 M282V variant modulates Wnt signaling in a manner similar to HBM. In addition, this data 
confirms that modifications of other residues in propeller 1 of LRP5 can result in an HBM 
effect. 

These data support an "occupied space" model of the HBM mutation in propeller 1 
and show that multiple mutations of propeller 1 are capable of generating an HBM effect; the 

15 original G171V HBM mutation is not unique in this ability. Moreover, various perturbations 
in propeller 1 can modulate Dkk activity. 

These data illustrate the molecular mechanism of Dkk modulation of LRP signaling. 
Using the methods disclosed herein and in U.S. Application 60/290,071, generation of a 
comprehensive mutant panel will reveal residues in LRP that function in Dkk modulation of 

20 Wnt signaling. Such variants of LRP5 and LRP6 that modulate Dkk activity and the 

residues which distinguish them from LRP5 and LRP6 are points for therapeutic intervention 
by small molecule compound, antibody, peptide aptamer, or other agents. Furthermore, 
models of each HBM-effect mutation/polymorphism may be used in rational drug design of 
an HBM mimetic agent. 

25 These and the examples provided infra are only a few illustrative examples presented 

to better describe the present invention. Variants of LRP5 which have demonstrated HBM 
activity in assays include A65V, G171I, G171V (HBM), G171F, M282V, G171K, G171Q 
and A214V. Clearly, other variants may be contemplated within the scope of the present 
invention. Furthermore, wherever HBM is recited in the methods of the invention, it should 

30 be understood that any such alternative variant of LRP 5 or LRP6 which demonstrates HBM 
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like biological activity is also encompassed by those claims. 

Additional mutations may also result in conformational changes such as those 
described above and in the Examples below. These mutations may also result in a HBM-like 
phenotype when expressed. The following mutations have been identified in Table 2 based 
5 on familial genetics in the exons of Zmaxl (LRP5). 

TABLE 2: Mutations in the Exons of LRP5 



Exon Location Reference Change Met as AA 
(ATG as nt 1) AA1 Change 



.0 



2 


249 


CGAGGAGGCCATC 


r/T 




None 


2 


266 


AGACCTACCT 


A/G 






3 


512 


GTGAGACGCC 


G/T 


VJ 1 / 1 


\jr-* v 


6 


1199 


CGTACCTGGACGGG 


r/T 


Aidon 

Jr\*T\J\J 


V 


Q 
o 




LAJULr L L Ls ACvjC 1 (jC 


C/T 


F549 


None 


9 


1932 


GGCCTTCTTGGTCT 


G/A 


E644 


None 


9 


1999 


GTGGCCATCCCGCT 


G/A 


V667 


V-M 


10 


2220 


GAGAATCGAAGTG 


C/T 


N740 


None 


14 


3107 


GGACACTGTT 


G/A 


R1036 


R-Q. 


15 


3297 


CGGCACCGAGCG 


C/T 


D1099 


None 


15 


3357 


AGACAACACACTG 


A/G 


V1119 


None 


16 


3564 


GACTCGCATC 


G/A 


R1188 


None 


18 


3989 


CGGACTGTGA 


C/T 


A1330 


A-V 


20 


4137 


CAGCCCGGCCCAC 


C/T 


D1379 


None 


20 


4248 


GGGGGCCAACGG 


G/A 


A1416 


None 


20 


4565 


CGGCCACTGC 


C/T 


PI 522 


P-L 


23 


4635 


CGACGTGTGTGACA 


C/T 


T1545 


None 



15 

By, for example "C/T" is meant there is a C to T mutation. 

Additional mutations have also been identified in the introns, which may result in 
splice variants are provided in Table 3 below. 
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TABLE 3: Mutations in the Introns of LRP5 





Exon 


Nucleotide Position 
witii respect to iLxon 


Sequence with SNP Location 
u naeruneu 


Nucleotide 




z 


ICO 
TJJ 




P/T 
C/ 1 




D 




tp a ptpp apa pTpn 


T/P 
1/C 


D 


D 


+o 




Pi/ A 




n 
1 


-DU 


np AnAPA pp APAP 


P/ A 




Q 

o 


lie 


tpptpttpppp a tt 


1/VJ 




Q 


1 Q 1 


TPPnnPTP A PTPPT 
JLvJvjrOUrLr 1 vjAvj 1 1 


T/P 
1/C 




i n 
1U 


"TO 


jSj 111 CjL^U 1 kj 1 L/LL 


T/P 
1/C 


1 A 


i 1 


1 11 

-1 /J 


A 1 vj 1 Lr 1 Lr 1 ooC Avj 


A /PI 




1 1 
1 1 


1 CO 

-1DZ 


fPTPTPPPPPTTP 


i a 
vj/A 




1 1 
1 1 


Ad 


PPPTP APA PP APAf 

LiCjLrl IxACjrALrUACjrAC 


P/T 

U/l 




1 1 
1 1 


-L'J'7 

+3 / 


i^vjLrO<jCAijrv^CoVjfLr 


P/T 

U/l 




1 1 
1 1 


+ /o 


PT A PPPTPTPP,PPT 

ll 1 ALLL 1 o 1 vjvjL^ 1 


PI/ A 
vjr/A 


1 < 
1 J 


1Z 


J.QA 

+5U 


PTP A TPTHPinriTTP 

L^lCAieiurvjiaVJl 1L 


p/p 




1 o 
1Z 


J_1 A 1 

+141 


A TP A TPPT A PPTPP 

A 1 UA 1 ut 1 ALL 1 kjvj 


A /P 




ID 


1 

-loo 


r*r*rzr~i a atttppapa 


P/T 
W 1 




1 C 
ID 




TTPTTP A A PT A PT A 

_L1 CjI 1 LAAL 1 J\kj 1 A 


T/P 
1/C 




ID 


-jZ 


TPPP A PP APA PPP 

1 U L> Lr A Vjrta A vjr AC uL 


T/P 
1 /VJT 


op* 

ZU 


i o* 


-Z13 


TTPTTTPPPLT'P A Tf* 

JLICjI 1 lCL/(jrVjrCAlC 


T/P 
i/C 




17 


-82 


CATTTGGCCCCTA 


C/T 




18 


-72 


GCCCAGTCAC 


G/A 




18 


-63 


CGCCATTGCC 


C/T 




18 


-30 


GTGTGATGTT 


G/A 


25 


18 


+23 


IGATCTGGAGGAGG 


T/C 




18 


+47 


GTCTGGGCAGCTTT 


G/C 




18 


+54 


CACCGTCAGTGCT 


C/A 
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Exon 



Nucleotide Position 
with respect to Exon 



Sequence with SNP Location 
Underlined 



Nucleotide 



22 



-118 



GGCACCTGCC 



G/A 



These splice variants may also produce an altered phenotype capable of conferring a 
HBM-like effect. 

These mutations were identified using 80 ng of genomic DNA, which was PCR 
amplified with the primers indicated in Table 4 below with M13F 
(TGTAAAACGACGGCCAGT) attached to the 5' end of the Forward primer or M13R 
(AGGAAACAGCTATGACCAT) attached to the 5 1 end of the Reverse primer. 4 /A of the 
PCR reaction was diluted to a final volume of 100 iA with water. Sequencing was performed 
using ET and M13F and M13R primers or by the ABI-PRISM® Big-Dye™ method of 
Applied Biosystems using the indicated nested sequencing primers. The sequences are 
assembled on consed with the appropriate reference sequence. 

The PCR mixes used are as follows 

Promega: 50 mM KC1, 10 mM Tris-HCl, pH 9.0, 0.1% Triton X-100, 1.5 mM 



Invitrogen D: 60 mM Tris-HCl, pH 8.5, 15 mM (NH 4 ) 2 S0 4 , 3.5 mM MgCl 2 
Invitrogen J: 60 mM Tris-HCL, pH 9.5, 1 5 mM (NH 4 ) 2 S0 4 , 2.0 mM MgCl 2 

InvitrogenM: 60 mM Tris-HCl, pH 10.0, 15 mM (NH 4 ) 2 S0 4 , 1.5 mM MgCl 2 
To all the PCT mixes, 120 /uM of all 4 dNTPs was added, 0.4 /uM of the forward and reverse 
primers, 80 ng genomic DNA, 1 U of AmpliTaq® DNA polymerase, and 1 . 1 U of TaqStart 
antibody (Clontech). The PCR reaction is than run as follows: 94 °C, 2 min; (94°C, 30 sec; 
X-anneal-temp, 30 sec; 72°C, 2 min) for 35 cycles and 72°C for 3 min. 



MgCl 2 
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£ 

"O 

c 



UJ 

< 



2 .ES 



i i 



u 
a. 

ai 



SI 



SI 



p 



S! 



p 



(3 



P 



SI 



SI 



S 



u 
p 



s 

s 



s 



31 



ID 

I 

P 
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4. GenotV Pinf* of Microsatellite Markers 

To narrow the genetic interval to a region smaller than that originally reported by 
Johnson et al., 1997 Am. J. Hum. Genet, 60:1326-1332, additional microsatellite markers on 
chromosome llql2-13 were typed. The new markers included: D11S4191, D11S1883, 
5 DllS1785,DHS4113,DllS4136,DllS4139,(Dib^a/. 5 1996 Nature, 380:152-154), FGF3 
(Polymeropolous eta!., 1990 Nucl Acid Res., 18: 7468), as well as GTC JHBM_Marker_l , 
GTC_HBM_MarkerJ2, GTC_HBM__Marker__3, GTC_HBMJVlarker_4 5 
GTC HBM _Marker_5 , GTCJHBM_Marker__6, and GTC__HBMJVIarker_7 (See Fig. 2). 

Blood (20 ml) was drawn into lavender cap (EDTA containing) tubes by a certified 
0 phlebotomist. The blood was stored refrigerated until DNA extraction. DNA has been 
extracted from blood stored for up to 7 days in the refrigerator without reduction in the 
quality or quantity of yield. For those subjects that have blood drawn at distant sites, a 
shipping protocol was successfully used on more than a dozen occasions. Blood samples 
were shipped by overnight express in a styrofoam container with freezer packs to provide 
5 cooling. Lavender cap tubes were placed on individual plastic shipping tubes and then into 
"zip-lock" biohazard bags. When the samples arrived the next day, they were immediately 
processed to extract DNA. 

The DNA extraction procedure used a kit purchased from Gentra Systems, Inc. 
(Minneapolis, Minnesota). Briefly, the procedure involved adding 3 volumes of a red blood 
3 cell lysis buffer to the whole blood. After incubations for 10 minutes at room temperature, 
the solution was centrifuged in a Beckman tabletop centrifuge at 2,000 X g for 10 minutes. 
The white blood cell pellet was resuspended in Cell Lysis Buffer. Once the pellet was 
completely resuspended and free of cell clumps, the solution was digested with RNase A for 
15 minutes at 37°C. Proteins were precipitated by addition of the provided Protein 
> Precipitation Solution and removed by centrifugation. The DNA was precipitated out of the 
supernatant by addition of isopropanol. This method was simple and fast, requiring only 1-2 
hours, and allowed for the processing of dozens of samples simultaneously. The yield of 
DNA was routinely >8 mg for a 20 ml sample of whole blood and had a MW of >50 kb. 
DNA was archived by storing coded 50 jug aliquots at -80°C as an ethanol precipitate. 
) DNA was genotyped using one fluorescently labeled oligonucleotide primer and one 
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unlabeled oligonucleotide primer. Labeled and unlabeled oligonucleotides were obtained 
from Integrated DNA Technologies, Inc. (Coralville, Iowa). All other reagents for 
microsatellite genotyping were purchased from Perkin Elmer- Applied Biosystems, Inc. ("PE- 
ABI") (Norwalk, Connecticut). Individual PGR reactions were performed for each marker, as 
5 described by PE-ABI using AmpliTaq™ DNA Polymerase. The reactions were added to 3.5 
jil of loading buffer containing deionized formamide, blue dextran and TAMRA 350 size 
standards (PE-ABI). After heating at 95 °C for 5 minutes to denature the DNA, the samples 
were loaded and electrophoresed as described in the operator's manual for the Model 377 
DNA Sequencer (PE-ABI, Foster City, California). After gel electrophoresis, the data was 

10 analyzed using PE-ABI GENES CAN™ and GENOTYPER™ software. First, within the 

GENESCAN™ software, the lane tracking was manually optimized prior to the first step of 
analysis. After the gel lane data was extracted, the standard curve profiles of each lane were 
examined and verified for linearity and size calling. Lanes, which had problems with either 
of these parameters, were re-tracked and verified. Once all lanes were tracked and the size 

15 standards were correctly identified, the data were imported into GENOTYPER™ for allele 
identification To expedite allele calling (binning), the program Linkage Designer from the 
Internet web-site of Dr. Guy Van Camp (http://alt.www.xiia.ac.be/u/dnalab/ld.html) was used. 
This program greatly facilitates the importing of data generated by GENOTYPER™ into the 
pedigree drawing program Cyrillic (Version 2.0, Cherwell Scientific Publishing Limited, 

20 Oxford, Great Britain) and subsequent linkage analysis using the program LINKAGE 
(Lathrop etal, 1985 Am. J. Hum. Genet 37: 482-98). 

5. Linkage Analysis 

Fig. 1 demonstrates the pedigree of the individuals used in the genetic linkage studies 
25 for this invention. Specifically, two-point linkage analysis was performed using the MLHSfK 
and LDSfKMAP components of the program LINKAGE (Lathrop et al 9 1985 Am. J. Hum. 
Genet., 37: 482-98). Pedigree/marker data was exported from Cyrillic as a pre-file into the 
Makeped program and converted into a suitable ped-file for linkage analysis. 

The original linkage analysis was performed using three models: (i) an autosomal 
30 dominant, fully penetrant model, (ii) an autosomal dominant model with reduced penetrance, 
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and (iii) a quantitative trait model. The HBM locus was mapped to chromosome 1 lql2-13 
by analyzing DNA for linked markers from 22 members of a large, extended kindred. A 
highly automated technology was used with a panel of 345 fluorescent markers which 
spanned the 22 autosomes at a spacing interval ranging from 6-22 cM. Only markers from 
this region of chromosome 1 1 showed evidence of linkage (LOD score -3.0). The highest 
LOD score (5.74) obtained by two-point and multipoint analysis was Dl 1S987 (map position 
55 in Fig. 2). The 95% confidence interval placed the HBM locus between markers Dl 1S905 
and Dl 1S937 (map position 41-71 in Fig. 2). Haplotype analysis also places the Zmaxl 
(LRP5) gene in this same region. Further descriptions of the markers Dl 1S987, Dl 1S905, 
and Dl 1S937 can be found in Gyapay et al, 1994 Nature Genetics, Vol. 7. 

In this invention, the inventors report the narrowing of the HBM interval to the region 
between markers Dl 1S987 and GTC_HBM_Marker_5. These two markers lie between the 
delimiting markers from the original analysis (Dl 1S905 and Dl 1S937) and are 
approximately 3 cM from one another. The narrowing of the interval was accomplished 
using genotypic data from the markers D11S4191, D11S1883, D11S1785, D11S4113, 
Dl 1S4136, Dl 1S4139, (Dib et al., 1996 Nature 380: 152-4), FGF3 (Polymeropolous et al, 
1990 Nucl. Acid Res., 18: 7468) (information about the genetic markers can be found at the 
Internet site of the Genome Database, http://gdbwww.gdb.org/), as well as the markers 
GTC_HBM_Marker_l, GTC_HBM_Marker_2, GTC_HBM_Marker_3, 
GTC_HBM_Marker_4, GTC_HBM_Marker_5, GTC_HBM_Marker_6, and 
GTC_HBM_Marker_7. 

As shown in Fig. 1, haplotype analysis with the above genetic markers identifies 
recombination events (crossovers) in individuals 9019 and 9020 that significantly refine the 
interval of chromosome 1 1 to which the Zmaxl (LRP5) gene is localized. Individual 9019 is 
an HBM-affected individual that inherits a portion of chromosome 1 1 from the maternal 
chromosome with the HBM gene, and a portion from the chromosome 1 1 homologue. The 
portion inherited from the HBM gene-carrying chromosome includes markers D11S935, 
D11S1313, GTC_HBM_Marker_4, D11S987, D11S1296, GTC_HBM_Marker_6, 
GTC_HBM_Marker_2, Dl 1S970, GTC__HBM_Marker_3, Dl 1S41 13, 
GTC_HBM_Marker_ 1 , GTC_HBM__Marker_7 and GTC_HBM_Marker_5 . The portion 
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from Dl 1S4136 and continuing in the telomeric direction is derived from the non-HBM 
chromosome. This data places the Zmaxl (LRP5) gene in a location centromeric to the 
marker GTC_HBM_Marker_5 . Individual 9020 is an unaffected individual who also exhibits 
a critical recombination event. This individual inherits a recombinant paternal chromosome 
5 11 that includes markers D11S935, D11S1313, GTC_HBM_Marker_4, D11S987, D11S1296 
and GTC_HBM_Marker_6 from her father's (individual 0115) chromosome 1 1 homologue 
that carries the HBM gene, and markers GTC_HBM_Marker_2 5 D 1 1 S970, 
GTC_HBM JMarker_3 , GTC_HBM_Marker_l, GTC_HBM Marker 7 , 
GTC_HBM_Marker_5,DllS4136,DllS4139,DllS1314, and D11S937 from her father's 

10 chromosome 1 1 that does not carry the /fflMgene. Marker Dl 1S41 13 is uninformative due 
to its homozygous nature in individual 01 15. This recombination event places the 
centromeric boundary of the HBM region between markers Dl IS 1296 and Dl 1S987. 

Two-point linkage analysis was also used to confirm the location of the Zmaxl 
(LRP5) gene on chromosome 1 1 . The linkage results for two point linkage analysis under a 

15 model of full penetrance are presented in Table 5 below. This table lists the genetic markers 
in the first column and the recombination fractions across the top of the table. Each cell of 
the column shows the LOD score for an individual marker tested for linkage to the Zmaxl 
(LRP5) gene at the recombination fraction shown in the first row. For example, the peak 
LOD score of 7.66 occurs at marker Dl 1S970, which is within the interval defined by 

20 haplotype analysis. 
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TABLE 5 



Marker 


0.0 


0.05 


0.1 


0.15 


0.2 


0.25 


0.3 


0.35 


0.4 


D11S935 


- infinity 


0.39 


0.49 


0.47 


0.41 


0.33 


0.25 


0.17 


0.10 


D11S1313 


- infinity 


2.64 


2.86 


2.80 


2.59 


2.30 


1.93 


1.49 


1.00 


D11S987 


- infinity 


5.49 


5.18 


4.70 


4.13 


3.49 


2.79 


2.03 


1.26 


D11S4113 


4.35 


3.99 


3.62 


3.24 


2.83 


2.40 


1.94 


1.46 


0.97 


D11S1337 


2.29 


2.06 


1.81 


1.55 


1.27 


0.99 


0.70 


0.42 


0.18 


D11S970 


7.66 


6.99 


6.29 


5.56 


4.79 


3.99 


3.15 


2.30 


1.44 


D11S4136 


6.34 


5.79 


5.22 


4.61 


3.98 


3.30 


2.59 


1.85 


1.11 


D11S4139 


6.80 


6.28 


5.73 


5.13 


4.50 


3.84 


3.13 


2.38 


1.59 


FGF3 


0.59 


3.23 


3.15 


2.91 


2.61 


2.25 


1.84 


1.40 


0.92 


D11S1314 


6.96 


6.49 


5.94 


5.34 


4.69 


4.01 


3.27 


2.49 


1.67 


D11S937 


-infinity 


4.98 


4.86 


4.52 


4.06 


3.51 


2.88 


2.20 


1.47 



A single nucleotide polymorphism (SNP) further defines the HBM region. This SNP 
is termed SNP_Contig033-6 and is located 25 kb centromeric to the genetic marker 
GTC_HBM_Marker_5 . This SNP is telomeric to the genetic marker GTC_HBM_Marker_7. 
SNP_Contig033-6 is present in HBM-affected individual 01 13. However, the HBM-affected 
individual 9019, who is the son of 0113, does not carry this SNP. Therefore, this indicates 
that the crossover is centromeric to this SNP. The primer sequence for the genetic markers 
GTC_HBM_Marker_5 and GTC_HBM_Marker_7 is shown in Table 6 below. 



35 TABLE 6 



Marker 


Primer (Forward) 


Primer (Reverse) 


GTC_HBM_ 
Marker 5 


1T1TGGGTACACAATTCAGTCG 
(SEQ ID NO:63) 


AAAACTGTGGGTGCTTCTGG 
(SEQ ID NO:65) 


GTCJHBM_ 
Marker 7 


GTGATTGAGCCAATCCTGAGA 
(SEQ ID NO:64) 


TGAGCCAAATAAACCCCTTCT 
(SEQ ID NO:66) 



The kindred described have several features of great interest, notably that their bones, 
while very dense, have an absolutely normal shape. The outer dimensions of the skeletons of 
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the HBM-affected individuals are normal, and, while medullary cavities are present, there is 
no interference with hematopoiesis. The HBM-affected members seem to be resistant to 
fracture, and there are no neurologic symptoms, and no symptoms of impairment of any 
organ or system function in the members examined. HBM-affected members of the kindred 
5 live to advanced age without undue illness or disability. Furthermore, the HBM phenotype 
matches no other bone disorders such as osteoporosis, osteoporosis pseudoglioma, 
Engelmann's disease, Ribbing's disease, hyperphosphatasemia, Van Buchem's disease, 
melorheostosis, osteopetrosis, pycnodysostosis, sclerostenosis, osteopoikilosis, acromegaly, 
Paget's disease, fibrous dysplasia, tubular stenosis, osteogenesis imperfecta, 

10 hypoparathyroidism, pseudohypoparathyroidism, pseudopseudohypoparathyroidism, primary 
and secondary hyperparathyroidism and associated syndromes, hypercalciuria, medullary 
carcinoma of the thyroid gland, osteomalacia and other diseases. Clearly, the HBM locus in 
this family has a very powerful and substantial role in regulating bone density, and its 
identification is an important step in understanding the pathway(s) that regulate bone density 

15 and the pathogenesis of diseases such as osteoporosis. 

In addition, older individuals carrying the HBM gene, and therefore expression of the 
HBM protein, do not show loss of bone mass characteristic of normal individuals. In other 
words, the HBM gene is a suppressor of osteoporosis. In essence, individuals carrying the 
HBM gene are dosed with the HBM protein, and, as a result, do not develop osteoporosis. 

20 This in vivo observation is strong evidence that treatment of normal individuals with the 
HBM gene or protein, or a fragment thereof, will ameliorate osteoporosis. 

6. Physical Mapping 

To provide reagents for the cloning and characterization of the HBM locus, the 
25 genetic mapping data described above were used to construct a physical map of the region 

containing Zmaxl (LRP5) on chromosome llql3.3. The physical map consists of an ordered 
set of molecular landmarks, and a set of BAC clones that contain the Zmaxl (LRP5) gene 
region from chromosome llql3.3. 

Various publicly available mapping resources were utilized to identify existing STS 
30 markers (Olson et aL, 1989 Science 245: 1434-5) in the HBM region. Resources included the 
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GDB, the Whitehead Institute Genome Center, dbSTS and dbEST (NCBI), 1 ldb, the 
University of Texas Southwestern GESTEC, the Stanford Human Genome Center, and 
several literature references (Courseaux etal, 1997 Genomics 40: 13-23; Courseaux et al, 
1996 Genomics 37: 354-65; Guru et aL, 1997 Genomics 42: 436-45; Hosoda et al 9 1997 
Genes Cells 2: 345-57; James et al, 1994 Nat Genet 8: 70-76; Kitamura et al % 1997 DNA 
Research, 4: 281-9; Lemmens et al, 1997 Genomics 44: 94-100; and Smith et al. 9 1997 
Genome Res. 7: 835-42). Maps were integrated manually to identify markers mapping to the 
region containing Zmaxl (LRP5). 

Primers for existing STSs were obtained from the GDB or literature references are 
listed in Table 6 below. Thus, Table 7 shows the STS markers used to prepare the physical 
map of the Zmaxl (LRP5) gene region. 
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Novel STSs were developed either from publicly available genomic sequence or from 
sequence-derived BAC insert ends. Primers were chosen using a script which automatically 
performs vector and repetitive sequence masking using Cross_match (P. Green, Univ. of 
Washington) and subsequent primer picking using Primer3 (Rozen, Skaletsky (1996, 1997). 
5 Primer3 is available at: www.genome.wi.mit.edii/genome_software/other/primer3 .html. 

Polymerase chain reaction (PCR) conditions for each primer pair were initially 
optimized with respect to MgCl 2 concentration. The standard buffer was 10 mM Tris-HCl 
(pH 8.3), 50 mM KC1, MgCl 2 , 0.2 mM each dNTP, 0.2 \iM each primer, 2.7 ng/^il human 
DNA, 0.25 units of AmpliTaq (Perkin Elmer) and MgCl 2 concentrations of 1.0 mM, 1.5 mM, 
10 2.0 mM or 2.4 mM. Cycling conditions included an initial denaturation at 94°C for 2 

minutes followed by 40 cycles at 94°C for 15 seconds, 55 °C for 25 seconds, and 72°C for 25 
seconds followed by a final extension at 72 °C for 3 minutes. Depending on the results from 
the initial round of optimization the conditions were further optimized if necessary. 
Variables included increasing the annealing temperature to 58 °C or 60 °C, increasing the 

15 cycle number to 42 and the annealing and extension times to 30 seconds, and using 
AmpliTaq™ Gold (Perkin Elmer). 

BAC clones (Kim et al 9 1996 Genomics 32: 213-8; and Shizuya et al, 1992 Proa 
Natl. Acad. Set USA 89: 8794-7) containing STS markers of interest were obtained by PCR- 
based screening of DNA pools from a total human BAC library purchased from Research 

20 Genetics. DNA pools derived from library plates 1-596 were used corresponding to nine 

genomic equivalents of human DNA. The initial screening process involved PCR reactions 
of individual markers against superpools, i.e., a mixture of DNA derived from all BAC clones 
from eight 384-well library plates. For each positive superpool, plate (8), row (16) and 
column (24), pools were screened to identify a unique library address. PCR products were 

25 electrophoresed in 2% agarose gels (Sigma) containing 0.5 ng/ml ethidium bromide in IX 
TBE at 150 volts for 45 min. The electrophoresis units used were the Model A3-1 systems 
from Owl Scientific Products. Typically, gels contained 10 tiers of lanes with 50 wells/tier. 
Molecular weight markers (100 bp ladder, Life Technologies, Bethesda, MD) were loaded at 
both ends of the gel. Images of the gels were captured with a Kodak DC40 CCD camera and 

30 processed with Kodak ID software. The gel data were exported as tab delimited text files; 
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names of the files included information about the library screened, the gel image files and the 
marker screened. These data were automatically imported using a customized Perl script into 
Filemaker™ PRO (Claris Corp.) databases for data storage and analysis. In cases where 
incomplete or ambiguous clone address information was obtained, additional experiments 
5 were performed to recover a unique, complete library address. 

Recovery of clonal BAC cultures from the library involved streaking out a sample 
from the library well onto LB agar (Maniatis et aL, Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1982)) containing 12.5 
|Ag/ml chloramphenicol (Sigma). Two individual colonies and a portion of the initial streak 
10 quadrant were tested with appropriate STS markers by colony PCR for verification. Positive 
clones were stored in LB broth containing 12.5 ^ig/ml chloramphenicol and 15% glycerol at - 
70°C. 

Several different types of DNA preparation methods were used for isolation of BAC 
DNA. The manual alkaline lysis miniprep protocol listed below (Maniatis et aL, 1982) was 

15 successfully used for most applications, i.e., restriction mapping, CHEF gel analysis, FISH 

mapping, but was not successfully reproducible by endsequencing. The Autogen and Qiagen 
protocols were used specifically for BAC DNA preparation for endsequencing purposes. 

Bacteria were grown in 15 ml Terrific Broth containing 12.5 fig/ml chloramphenicol 
in a 50 ml conical tube at 37 °C for 20 hrs with shaking at 300 rpm. The cultures were 

20 centrifuged in a Sorvall RT 6000 D at 3000 rpm (~1800xg) at 4°C for 15 min. The 

supernatant was then aspirated as completely as possible. In some cases, cell pellets were 
frozen at -20 °C at this step for up to 2 weeks. The pellet was then vortexed to homogenize 
the cells and minimize clumping. 250 jil of PI solution (50 mM glucose, 15 mM Tris-HCl, 
pH 8, 10 mM EDTA, and 100 ng/ml RNase A) was added and the mixture pipetted up and 

25 down to mix. The mixture was then transferred to a 2 ml Eppendorf tube. 350 ^1 of P2 

solution (0.2 N NaOH, 1% SDS) was then added, the mixture mixed gently and incubated for 
5 min at room temperature. 350 |il of P3 solution (3 M KOAc, pH 5.5) was added and the 
mixture mixed gently until a white precipitate formed. The solution was incubated on ice for 
5 min and then centrifuged at 4°C in a microfuge for 10 min. The supernatant was 

30 transferred carefully (avoiding the white precipitate) to a fresh 2 ml Eppendorf tube, and 0.9 

67 



BNSDOCID: <WO 02092000 A2J_> 



WO 02/092000 



PCT/US02/14877 



ml of isopropanol was added, the solution mixed and left on ice for 5 min. The samples were 
centrifuged for 10 min, and the supernatant removed carefully. Pellets were washed in 70% 
ethanol and air dried for 5 min. Pellets were resuspended in 200 ^il of TE8 (10 mM Tris-HCl, 
pH 8.0, 1.0 mM EDTA), and RNase A (Boehringer Mannheim) added to 100 jig/ml. 
Samples were incubated at 37°C for 30 min, then precipitated by addition of C 2 H 3 0 2 Na-3H 2 0 
to 0.5 M and 2 volumes of ethanol. Samples were centrifuged for 10 min, and the pellets 
washed with 70% ethanol followed by air drying and dissolving in 50 fA TE8. Typical yields 
for this DNA prep were 3-5 /^g/15 ml bacterial culture. Ten to 15 jjl were used for HindlH 
restriction analysis; 5 jxl was used for Notl digestion and clone insert sizing by CHEF gel 
electrophoresis. 

BACs were inoculated into 15 ml of 2X LB Broth containing 12.5 ixg/ml 
chloramphenicol in a 50 ml conical tube. Four tubes were inoculated for each clone. 
Cultures were grown overnight (-16 hr) at 37°C with vigorous shaking (>300 rpm). 
Standard conditions for B AC DNA isolation were followed as recommended by the Autogen 
740 manufacturer. 3 ml samples of culture were placed into Autogen tubes for a total of 60 
ml or 20 tubes per clone. Samples were dissolved finally in 100 jxl TE8 with 15 seconds of 
shaking as part of the Autogen protocol. After the Autogen protocol was finished DNA 
solutions were transferred from each individual tube and pooled into a 2 ml Eppendorf tube. 
Tubes with large amounts of debris (carry over from the pelleting debris step) were avoided. 
The tubes were then rinsed with 0.5 ml of TE8 successively and this solution added to the 
pooled material. DNA solutions were stored at 4°C; clumping tended to occur upon freezing 
at -20 °C. This DNA was either used directly for restriction mapping, CHEF gel analysis or 
FISH mapping or was further purified as described below for use in endsequencing reactions. 

The volume of DNA solutions was adjusted to 2 ml with TE8, samples were then 
mixed gently and heated at 65 °C for 10 min. The DNA solutions were then centrifuged at 
4°C for 5 min and the supernatants transferred to a 15 ml conical tube. The NaCl 
concentration was then adjusted to 0.75 M (-0.3 ml of 5 M NaCl to the 2 ml sample). The 
total volume was then adjusted to 6 ml with Qiagen column equilibration buffer (Buffer 
QBT). The supernatant containing the DNA was then applied to the column and allowed to 
enter by gravity flow. Columns were washed twice with 10 ml of Qiagen Buffer QC. Bound 
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DNA was then eluted with four separate 1 ml aliquots of Buffer QF kept at 65 °C. DNA was 
precipitated with 0.7 volumes of isopropanol (-2.8 ml). Each sample was then transferred to 
4 individual 2.2 ml Eppendorf tubes and incubated at room temperature for 2 hr or overnight. 
Samples were centrifuged in a microfuge for 10 min. at 4°C. The supernatant was removed 
5 carefully and 1 ml of 70% ethanol was added. Samples were centrifuged again and because 
the DNA pellets were often loose at this stage, the supernatant removed carefully. Samples 
were centrifuged again to concentrate remaining liquid which was removed with a micropipet 
tip. DNA pellets were then dried in a desiccator for 10 min. 20 |il of sterile distilled and 
deionized H z O was added to each tube which was then placed at 4°C overnight. The four 20 

10 \il samples for each clone were pooled and the tubes rinsed with another 20 jil of sterile 

distilled and deionized H 2 0 for a final volume of 100 jxl. Samples were then heated at 65 °C 
for 5 min. and then mixed gently. Typical yields were 2-5 jig/60 ml culture as assessed by 
Notl digestion and comparison with uncut lambda DNA. 

3 ml of LB Broth containing 12.5 jig/ml of chloramphenicol was dispensed into - 

15 autoclaved Autogen tubes. A single tube was used for each clone. For inoculation, glycerol 
stocks were removed from -70 °C storage and placed on dry ice. A small portion of the 
glycerol stock was removed from the original tube with a sterile toothpick and transferred •+ 
into the Autogen tube; the toothpick was left in the Autogen tube for at least two minutes - 
before discarding. After inoculation the tubes were covered with tape making sure the seal 

20 was tight. When all samples were inoculated, the tube units were transferred into an Autogen 
rack holder and placed into a rotary shaker at 37°C for 16-17 hours at 250 rpm. Following 
growth, standard conditions for BAC DNA preparation, as defined by the manufacturer, were 
used to program the Autogen. Samples were not dissolved in TE8 as part of the program and 
DNA pellets were left dry. When the program was complete, the tubes were removed from 

25 the output tray and 30 jil of sterile distilled and deionized H 2 0 was added directly to the 

bottom of the tube. The tubes were then gently shaken for 2-5 seconds and then covered with 
parafilm and incubated at room temperature for 1-3 hours. DNA samples were then 
transferred to an Eppendorf tube and used either directly for sequencing or stored at 4°C for 
later use. 

30 

69 

BNSDOCID: <WO 02092000 A2_l_> 



WO 02/092000 PCT7US02/14877 



7. BAC Clone Characterization for Physica l Mappin g 

DNA samples prepared either by manual alkaline lysis or the Autogen protocol were 
digested with HindUI for analysis of restriction fragment sizes. This data were used to 
compare the extent of overlap among clones. Typically 1-2 \ig were used for each reaction. 
5 Reaction mixtures included: IX Buffer 2 (New England Biolabs), 0.1 mg/ml bovine serum 

albumin (New England Biolabs), 50 (xg/ml RNase A (Boehringer Mannheim), and 20 units of 
HindlU (New England Biolabs) in a final volume of 25 jil. Digestions were incubated at 
37 °C for 4-6 hours. BAC DNA was also digested with NotI for estimation of insert size by 
CHEF gel analysis (see below). Reaction conditions were identical to those for HindUI 

10 except that 20 units of Notl were used. Six jxl of 6X Ficoll loading buffer containing 
bromphenol blue and xylene cyanol was added prior to electrophoresis. 

HincHU digests were analyzed on 0.6% agarose (Seakem, FMC Bioproducts) in IX 
TBE containing 0.5 fig/ml ethidium bromide. Gels (20 cm X 25 cm) were electrophoresed in 
a Model A4 electrophoresis unit (Owl Scientific) at 50 volts for 20-24 hrs. Molecular weight 

1 5 size markers included undigested lambda DNA 5 HindUI digested lambda DNA, and HaeUI 
digested _X174 DNA. Molecular weight markers were heated at 65 °C for 2 min prior to 
loading the gel. Images were captured with a Kodak DC40 CCD camera and analyzed with 
Kodak ID software. 

Notl digests were analyzed on a CHEF DRII (BioRad) electrophoresis unit according 
20 to the manufacturer's recommendations. Briefly, 1% agarose gels (BioRad pulsed field 

grade) were prepared in 0.5X TBE, equilibrated for 30 minutes in the electrophoresis unit at 
14 °C, and electrophoresed at 6 volts/cm for 14 hrs with circulation. Switching times were 
ramped from 10 sec to 20 sec. Gels were stained after electrophoresis in 0.5 p.g/ml ethidium 
bromide. Molecular weight markers included undigested lambda DNA, HindM digested 
25 lambda DNA, lambda ladder PFG ladder, and low range PFG marker (all from New England 
Biolabs). 

BAC DNA prepared either by the manual alkaline lysis or Autogen protocols were 
labeled for FISH analysis using a Bioprime labeling kit (BioRad) according to the 
manufacturer's recommendation with minor modifications. Approximately 200 ng of DNA 
30 was used for each 50 jjtl reaction. 3 |xl were analyzed on a 2% agarose gel to determine the 
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extent of labeling. Reactions were purified using a Sephadex G50 spin column prior to in 
situ hybridization. Metaphase FISH was performed as described (Ma et al., 1996 Cytogenet 
Cell Genet, 74: 266-71). 

5 8. BAC Endsequencing 

The sequencing of BAC insert ends utilized DNA prepared by either of the two 
methods described above. The DYEnamic energy transfer primers and Dynamic Direct cycle 
sequencing kits from Amersham were used for sequencing reactions. Ready made 
sequencing mix including the Ml 3 -40 forward sequencing primer was used (Catalog # 

10 US79730) for the T7 BAC vector terminus; ready made sequencing mix (Catalog # 

US79530) was mixed with the M13 -28 reverse sequencing primer (Catalog # US79339) for 
the SP6 BAC vector terminus. The sequencing reaction mixes included one of the four 
fluorescently labeled dye-primers, one of the four dideoxy termination mixes, dNTPs, 
reaction buffer, and Thermosequenase. For each BAC DNA sample, 3 p.1 of the BAC DNA 

15 sample was aliquoted to 4 PCR strip tubes. 2 |il of one of the four dye primer/termination 
mix combinations was then added to each of the four tubes. The tubes were then sealed and 
centrifuged briefly prior to PCR. Thermocycling conditions involved a 1 minute denaturation 
at 95 °C, 15 second annealing at 45 °C, and extension for 1 minute at 70°C for 35 total cycles. 
After cycling the plates were centrifuged briefly to collect all the liquid to the bottom of the 

20 tubes. 5 (xl of sterile distilled and deionized H 2 0 was then added into each tube, the plates 
sealed and centrifuged briefly again. The four samples for each BAC were then pooled 
together. DNA was then precipitated by adding 1.5 jil of 7.5 M NH 4 OAc and 100 \i\ of - 
20°C 100% ethanol to each tube. Samples were mixed by pipetting up and down once. The 
plates were then sealed and incubated on ice for 10 minutes. Plates were centrifuged in a 

25 table topHaraeus centrifuge at 4000 rpm (3,290xg) for 30 minutes at 4°C to recover the 
DNA. The supernatant was removed and excess liquid blotted onto paper towels. Pellets 
were washed by adding 100 \x\ of -20 °C 70% ethanol into each tube and re-centrifuging at 
4000 rpm (3,290xg) for 10 minutes at 4°C. The supernatant was removed and excess liquid 
again removed by blotting on a paper towel. Remaining traces of liquid were removed by 

30 placing the plates upside down over a paper towel and centrifiiging only until the centrifuge 
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reached 800 rpm. Samples were then air dried at room temperature for 30 min. Tubes were 
capped and stored dry at -20 °C until electrophoresis. Immediately prior to electrophoresis 
theDNA was dissolved in 1.5 jil of Amersham loading dye. Plates were then sealed and 
centrifuged at 2000 rpm (825Xg). The plates were then vortexed on a plate shaker for 1-2 
5 minutes. Samples were then recentrifuged at 2000 rpm (825Xg) briefly. Samples were then 
heated at 65 °C for 2 min and immediately placed on ice. Standard gel electrophoresis was 
performed on ABI 377 fluorescent sequencers according to the manufacturer's 
recommendation. 

10 9. Sub-cloning and Seq uencin g nfHRM RAf DNA 

The physical map of the Zmaxl (LRP5) gene region provides a set of BAC clones that 
contain within them the Zmaxl (LRP5) gene and the HBM gene. DNA sequencing of several 
of the BACs from the region has been completed. The DNA sequence data is a unique 
reagent that includes data that one skilled in the art can use to identify the Zmaxl (LRP5) 
1 5 gene and the HBM gene, or to prepare probes to identify the gene(s), or to identify DNA 
sequence polymorphisms that identify the gene(s). 

BAC DNA was isolated according to one of two protocols, either a Qiagen 
purification of BAC DNA (Qiagen, Inc. as described in the product literature) or a manual 
purification which is a modification of the standard alkaline lysis/Cesium Chloride 

20 preparation of plasmid DNA (see e.g., Ausubel et al, Current Protocols in Molecular 

Biology, John Wiley & Sons (1997)). Briefly for the manual protocol, cells were pelleted, 
resuspended in GTE (50 mM glucose, 25 mM Tris-Cl (pH 8), 10 mM EDTA) and lysozyme 
(50 mg/ml solution), followed by NaOH/SDS (1% SDS/0.2 N NaOH) and then an ice-cold 
solution of 3 M KOAc (pH 4.5-4.8). RNaseA was added to the filtered supernatant, followed 

25 by Proteinase K and 20% SDS. The DNA was then precipitated with isopropanol, dried and 
resuspended in TE (10 mM Tris, 1 mM EDTA (pH 8.0)). The BAC DNA was further 
purified by Cesium Chloride density gradient centrifugation (Ausubel et al, 1997). 

Following isolation, the BAC DNA was sheared hydrodynamically using an HPLC 
(Hengen, 1997 Trends in Biochem. Set 22: 273-4) to an insert size of 2000-3000 bp. After 

30 shearing, the DNA was concentrated and separated on a standard 1% agarose gel. A single 

72 



BNSDOCID: <WO 02092000A2_I_> 



WO 02/092000 PCT/US02/14877 



fraction, corresponding to the approximate size, was excised from the gel and purified by 
electroelution (Sambrook et ah, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring, NY (1989)). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. The 
5 blunt-ended DNA was then ligated to unique AstfXI-linker adapters (5'- GTCTTCACCACG 
GGG-3' and 5'-GTGGTGAAGAC-3' in 100-1000 fold molar excess; SEQ ID NOS: 627 and 
628 respectively). These linkers were complimentary to the BstXL-cut pMPX vectors 
(constructed by the inventors), while the overhang was not self-complimentary. Therefore, 
the linkers would not concatemerize nor would the cut- vector religate itself easily. The 

10 linker-adapted inserts were separated from the unincorporated linkers on a 1% agarose gel 
and purified using GeneClean (BIO 101, Inc.). The linker-adapted insert was then ligated to 
a modified pBlueScript vector to construct a "shotgun" subclone library. The vector 
contained an out-of-frame lacZ gene at the cloning site which became in-frame in the event 
that an adapter-dimer is cloned, allowing these to be avoided by their blue-color. 

15 All subsequent steps were based on sequencing by ABI377 automated DNA 

sequencing methods. Only major modifications to the protocols are highlighted. Briefly, the . 
library was then transformed into DH5a competent cells (Life Technologies, Bethesda, MD, 
DH5a transformation protocol). It was assessed by plating onto antibiotic plates containing 
ampicillin and IPTG/Xgal. The plates were incubated overnight at 37 °C. Successful 

20 transformants were then used for plating of clones and picking for sequencing. The cultures 
were grown overnight at 37°. DNA was purified using a silica bead DNA preparation (Ng et 
aL, 1996 Nucl Acids Res. 24: 5045-7) method. In this manner, 25 p.g of DNA was obtained 
per clone. 

These purified DNA samples were then sequenced using ABI dye-terminator 
25 chemistry. The ABI dye terminator sequence reads were run on ABI377 machines and the 

data was directly transferred to UNIX machines following lane tracking of the gels. All reads 
were assembled using PHRAP (P. Green, Abstracts of DOE Human Genome Program 
Contractor-Grantee Workshop V, Jan. 1996, p. 157) with default parameters and quality 
scores. The initial assembly was done at 6-fold coverage and yielded an average of 8-15 
30 contigs. Following the initial assembly, missing mates (sequences from clones that only 
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gave one strand reads) were identified and sequenced with ABI technology to allow the 
identification of additional overlapping contigs. Primers for walking were selected using a 
Genome Therapeutics program Pick_primer near the ends of the clones to facilitate gap 
closure. These walks were sequenced using the selected clones and primers. Data were 
reassembled with PHRAP into sequence contigs. 

10. Gene Identification hy Computat ional Methods 

Following assembly of the BAC sequences into contigs, the contigs were subjected to 
computational analyses to identify coding regions and regions bearing DNA sequence 
similarity to known genes. This protocol included the following steps. 

1 . Degap the contigs: the sequence contigs often contain symbols 
(denoted by a period symbol) that represent locations where the individual ABI 
sequence reads have insertions or deletions. Prior to automated computational 
analysis of the contigs, the periods were removed. The original data was maintained 
for future reference. 

2. BAC vector sequences were "masked" within the sequence by using 
the program cross match (Phil Green, http:\\chimera.biotech.washington.edu\UWGC). 
Since the shotgun libraries construction detailed above leaves some BAC vector in the 
shotgun libraries, this program was used to compare the sequence of the BAC contigs 
to the BAC vector and to mask any vector sequence prior to subsequent steps. 
Masked sequences were marked by an "X" in the sequence files, and remained inert 
during subsequent analyses. 

3. E. coli sequences contaminating the BAC sequences were masked by 
comparing the BAC contigs to the entire E. coli DNA sequence. 

4. Repetitive elements known to be common in the human genome were 
masked using cross match. In this implementation of crossmatch, the BAC sequence 
was compared to a database of human repetitive elements (Jerzy Jerka, Genetic 
Information Research Institute, Palo Alto, CA). The masked repeats were marked by 
X and remained inert during subsequent analyses. 

5. The location of exons within the sequence was predicted using the 
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MZEF computer program (Zhang, 1997 Proc. Natl. Acad. Sci USA. 94: 565-8). 

6. The sequence was compared to the publicly available unigene database 
(National Center for Biotechnology Information, National Library of Medicine, 3 8 A, 
8N905, 8600 Rockville Pike, Bethesda, MD 20894; www.ncbi.nlm.nih.gov) using 
5 the*blastn2 algorithm (Altschul et al, 1997 Nucl Acids Res. 25: 3389-3402). The 

parameters for this search were: E=0.05, v=50, B=50 (where E is the expected 
probability score cutoff, V is the number of database entries returned in the reporting 
of the results, and B is the number of sequence alignments returned in the reporting of 
the results (Altschul et aL, J. Mol Biol, 215:403-410 (1990)). 

10 7. The sequence was translated into protein for all six reading frames, and 

the protein sequences were compared to a non-redundant protein database compiled 
from Genpept Swissprot PIR (National Center for Biotechnology Information, 
National Library of Medicine, 38A, 8N905, 8600 Rockville Pike, Bethesda, MD 
20894; www.ncbi.nlm.nih.gov). The parameters for this search were E=0.05, V=50, 

15 B= 50, where E, V, and B are defined as above. 

8. The BAC DNA sequence was compared to the database of the cDNA 
clones derived from direct selection experiments (described below) using blastn2 
(Altschul et al y 1997). The parameters for this search were E=0.05, V=250, B=250, 
where E, V, and B are defined as above. 

20 9. The BAC sequence was compared to the sequences of all other BACs 

from the HBM region on chromosome 1 lql2-13 using blastn2 (Altschul et aL 9 1997). 
The parameters for this search were E=0.05, V=50, B=50, where E, V, and B are 
defined as above. 

10. The BAC sequence was compared to the sequences derived from the 
25 ends of BACs from the HBM region on chromosome 1 lql2-13 using blastn2 

(Altschul et al, 1997). The parameters for this search were E=0.05, V=50, B=50, 
where E, V, and B are defined as above. 

1 1 . The BAC sequence was compared to the GenBank database (National 
Center for Biotechnology Information, National Library of Medicine, 3 8 A, 8N905, 

30 8600 Rockville Pike, Bethesda, MD 20894; www.ncbi.nlm.nih.gov) using blastn2 
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(Altschul et al, 1997). The parameters for this search were E=0.05, V=50, B=50, 
where E, V, and B are defined as above. 

12. The B AC sequence was compared to the STS division of GenBanlc 
database (National Center for Biotechnology Information, National Library of 

5 Medicine, 38A, 8N905, 8600 Rockville Pike, Bethesda, MD 20894; 

www.ncbi.nlm.nih.gov) using blastn2 (Altschul et al, 1997). The parameters for this 
search were E=0.05, V=50, B= 50, where E, V, and B are denned as above. 

13. The BAC sequence was compared to the Expressed Sequence (EST) 
Tag GenBank database (National Center for Biotechnology Information, National 

10 Library of Medicine, 38A 8N905, 8600 Rockville Pike, Bethesda, MD 20894; 

www.ncbi.nlm.nih.gov) using blastn2 (Altschul et al, 1997). The parameters for this 
search were E=0.05, V=250, B=250, where E, V, and B are defined as above. 

11. Gene Identification hy Direct pDNA Selection 
1 5 Primary linkered cDNA pools were prepared from bone marrow, calvarial bone, 

femoral bone, kidney, skeletal muscle, testis and total brain. Poly (A) + RNA was prepared 
from calvarial and femoral bone tissue (Chomc2ynski et al, 1987 Anal Biochem. 162: 156-9; 
and DAlessio et al, 1987 Focus 9: 1-4) and the remainder of the mRNA was purchased from 
Clontech (Palo Alto, California). In order to generate oligo(dT) and random primed cDNA 

20 pools from the same tissue, 2.5 ug mRNA was mixed with oligo(dT) primer in one reaction 
and 2.5 ug mRNA was mixed with random hexamers in another reaction, and both were 
converted to first and second strand cDNA according to manufacturers recommendations 
(Life Technologies, Bethesda, MD). Paired phosphorylated cDNA linkers (see sequence 
below) were annealed together by mixing in a 1:1 ratio (10 ug each) incubated at 65 °C for 

25 five minutes and allowed to cool to room temperature. 
Paired linkers oligo l A 

OLIGO 1 : 5*-CTG AGC GGA ATT CGT GAG ACC-3* (SEQ ID NO:12) 
OLIGO 2: 5'-TTG GTC TCA CGT ATT CCG CTC GA-3' (SEQ ID NO: 1 3) 
Paired linkers oligo3/4 

30 OLIGO 3 : 5'-CTC GAG AAT TCT GGA TCC TC-3' (SEQ ID NO: 14) 
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OLIGO 4: 5'-TTG AGG ATC CAG AAT TCT CGA G-3' (SEQ ID NO:15) 
Paired linkers oligo5/6 

OLIGO 5: 5'-TGT ATG CGA ATT CGC TGC GCG-3' (SEQ ID NO:16) 

OLIGO 6: 5'-TTC GCG CAG CGA ATT CGC ATA CA-3' (SEQ ID NO: 17) 
5 Paired linkers oligo7/8 

OLIGO 7: 5'-GTC CAC TGA ATT CTC AGT GAG-3' (SEQ ID NO:18) 

OLIGO 8: 5'-TTG TCA CTG AGA ATT CAG TGG AC-3 f (SEQ ID NO:19) 
Paired linkers oligo 11/12 

OLIGO 11: 5'-GAA TCC GAA TTC CTG GTC AGC-3' (SEQ ID NO:20) 
10 OLIGO 12: 5'-TTG CTG ACC AGG AAT TCG GAT TC-3 ? (SEQ ID NO:21) 

Linkers were ligated to all oligo(dT) and random primed cDNA pools (see below) 
according to manufacturers instructions (Life Technologies, Bethesda, MD). 

Oligo Vi was ligated to oligo(dT) and random primed cDNA pools prepared from 
bone marrow. Oligo 3/4 was ligated to oligo(dT) and random primed cDNA pools prepared 
1 5 from calvarial bone. Oligo 5/6 was ligated to oligo(dT) and random primed cDNA pools 

prepared from brain and skeletal muscle. Oligo 7/8 was ligated to oligo(dT) and random * 
primed cDNA pools prepared from kidney. Oligo 11/12 was ligated to oligo(dT) and random 
primed cDNA pools prepared from femoral bone. 

The cDNA pools were evaluated for length distribution by PCR amplification using 1 
20 )j.l of a 1 : 1 , 1 : 1 0, and 1 : 1 00 dilution of the ligation reaction, respectively. PCR reactions 

were performed in a Perkin Elmer 9600, each 25 jil volume reaction contained 1 fil of DNA, 
10 mM Tris-HCl (pH 8.3), 50 mM KC1, 1.5 mM MgCl 2 , 0.001% gelatin, 200 mM each 
dNTPs, 10 fiM primer and 1 unit Taq DNA polymerase (Perkin Elmer) and was amplified 
under the following conditions: 30 seconds at 94°C, 30 seconds at 60°C and 2 minutes at 
25 72 °C for 30 cycles. The length distribution of the amplified cDNA pools were evaluated by 
electrophoresis on a 1% agarose gel. The PCR reaction that gave the best representation of 
the random primed and oligo(dT) primed cDNA pools was scaled up so that -2-3 (ig of each 
cDNA pool was produced. The starting cDNA for the direct selection reaction comprised of 
0.5 |ig of random primed cDNAs mixed with 0.5 \ig of oligo(dT) primed cDNAs. 
30 The DNA from the 54 BACs that were used in the direct cDNA selection procedure 
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was isolated using Nucleobond AX columns as described by the manufacturer (The Nest 
Group, Inc.). 

The BACs were pooled in equimolar amounts and 1 ug of the isolated genomic DNA 
was labeled with biotin 16-UTP by nick translation in accordance with the manufacturers 
instructions (Boehringer Mannheim). The incorporation of the biotin was monitored by 
methods that could be practiced by one skilled in the art (Del Mastro et al, Methods in 
Molecular Biology, Humana Press Inc., NJ (1996)). 

Direct cDNA selection was performed using methods that could be practiced by one 
skilled in the art (Del Mastro et al, 1996). Briefly, the cDNA pools were multiplexed in two 
separate reactions: In one reaction cDNA pools from bone marrow, calvarial bone, brain and 
testis were mixed, and in the other cDNA pools from skeletal muscle, kidney and femoral 
bone were mixed. Suppression of the repeats, yeast sequences and plasmid in the cDNA 
pools was performed to a Cot of 20. 1 00 ng of biotinylated BAC DNA was mixed with the 
suppressed cDNAs and hybridized in solution to a Cot of 200. The biotinylated DNA and the 
cognate cDNAs was captured on streptavidin-coated paramagnetic beads. The beads were 
washed and the primary selected cDNAs were eluted. These cDNAs were PCR amplified 
and a second round of direct selection was performed. The product of the second round of 
direct selection is referred to as the secondary selected material. A Galanin cDNA clone, 
previously shown to map to 1 lql2-13 (Evans, 1993 Genomics 18: 473-7), was used to 
monitor enrichment during the two rounds of selection. 

The secondary selected material from bone marrow, calvarial bone, femoral bone, 
kidney, skeletal muscle, testis and total brain was PCR amplified using modified primers of 
oligos 1, 3, 5, 7 and 11, shown below, and cloned into the UDG vector pAMPIO (Life 
Technologies, Bethesda, MD), in accordance with the manufacturer's recommendations. 

Modified primer sequences: 

Primer SEQ Sequence 
ID NO . 

Oligo 1 22 5--CUA CUA CUA CUA CTG AGC GGA ATT CGT GAG ACC-3' 

Ofigo 3 23 5'-CUA CUA CUA CUA CTC GAG AAT TCT GGA TCC TC-3' 

Oligo 5 24 5'-CUA CUA CUA CUA TGT ATG CGA ATT CGC TGC GCG-3' 
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Primer SEQ Sequence 
TP NO , 

Oiigo 7 25 5'-CUA CUA CUA CUA GTC CAC TGA ATT CTC AGT GAG-3' 

Oligo 1 1 26 5'-CUA CUA CUA CUA GAA TCC GAA TTC CTG GTC AGC-3 ' 

The cloned secondary selected material, from each tissue source, was transformed into 
5 MAX Efficiency DH5a Competent Cells (Life Technologies, Bethesda, MD) as 

recommended by the manufacturer. 384 colonies were picked from each transformed source 
and arrayed into four 96 well microtiter plates. 

All secondarily selected cDNA clones were sequenced using Ml 3 dye primer 
terminator cycle sequencing kit (Applied Biosystems), and the data collected by the ABI 377 

10 automated fluorescence sequencer (Applied Biosystems). 

All sequences were analyzed using the BLASTN, BLASTX and FASTA programs 
(Altschul et aU 1990 J. Mol Biol 215: 403-410; and Altschul et al 9 Nucl Acids, Res. 25: 
3389-3402). The cDNA sequences were compared to a database containing sequences 
derived from human repeats, mitochondrial DNA, ribosomal RNA, E. coli DNA to remove- 

15 background clones from the dataset using the program cross_match. A further round of 
comparison was also performed using the program BLASTN2 against known genes 
(GenBank) and the BAC sequences from the HBM region. Those cDNAs that were >90% 
homologous to these sequences were filed according to the result and the data stored in a 
database for further analysis. cDNA sequences that were identified but did not have 

20 significant similarity to the BAC sequences from the HBM region or were eliminated by 

cross_match were hybridized to nylon membranes which contained the B ACs from the HBM 
region, to ascertain whether they hybridized to the target. 

Hybridization analysis was used to map the cDNA clones to the BAC target that 
selected them. The BACs that were identified from the HBM region were arrayed and grown 

25 into a 96 well microtiter plate. LB agar containing 25 fig/ml kanamycin was poured into 96 
well microtiter plate lids. Once the agar had solidified, pre-cut Hybond N+ nylon membranes 
(Amersham) were laid on top of the agar and the BACs were stamped onto the membranes in 
duplicate using a hand held 96 well replica plater (V&P Scientific, Inc.). The plates were 
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incubated overnight at 37 °C. The membranes were processed according to the manufacturers 
recommendations. 

The cDNAs that needed to be mapped by hybridization were PCR amplified using the 
relevant primer (oligos 1, 3, 5, 7 and 1 1) that would amplify that clone. For this PCR 
amplification, the primers were modified to contain a linkered digoxigenin molecule at the 5* 
of the oligonucleotide. The PCR amplification was performed under the same conditions as 
described in Preparation of cDNA Pools (above). The PCR products were evaluated for 
quality and quantity by electrophoresis on a 1% agarose gel by loading 5 jxl of the PCR 
reaction. The nylon membranes containing the stamped BACs were individually pre- 
hybridized in 50 ml conical tubes containing 10 ml of hybridization solution (5X SSPE, 0.5X 
Blotto, 2.5% SDS and 1 mM EDTA (pH 8.0)). The 50 ml conical tubes were placed in a 
rotisserie oven (Robbins Scientific) for 2 hours at 65 °C. Twenty-five ng of each cDNA 
probe was denatured and added into individual 50 ml conical tubes containing the nylon 
membrane and hybridization solution. The hybridization was performed overnight at 65 °C. 
The filters were washed for 20 minutes at 65°C in each of the following solutions: 3X SSPE, 
0.1% SDS; IX SSPE, 0.1% SDS and 0.1X SSPE, 0.1% SDS. 

The membranes were removed from the 50 ml conical tubes and placed in a dish. 
Acetate sheets were placed between each membrane to prevent them from sticking to each 
other. The incubation of the membranes with the Anti-DIG-AP and CDP-Star was performed 
according to manufacturers recommendations (Boehringer Mannheim). The membranes were 
wrapped in Saran wrap and exposed to Kodak Bio-Max X-ray film for 1 hour. 

12. cDNA Cloning and ttxpressinn Analysis 

To characterize the expression of the genes identified by direct cDNA selection and 
genomic DNA sequencing in comparison to the publicly available databases, a series of 
experiments were performed to further characterize the genes in the HBM region. First, 
oligonucleotide primers were designed for use in the polymerase chain reaction (PCR) so that 
portions of a cDNA, EST, or genomic DNA could be amplified from a pool of DNA 
molecules (a cDNA library) or RNA population (RT-PCR and RACE). The PCR primers 
were used in a reaction containing genomic DNA to verify that they generated a product of 
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the size predicted based on the genomic (BAC) sequence. A number of cDNA libraries were 
then examined for the presence of the specific cDNA or EST. The presence of a fragment of 
a transcription unit in a particular cDNA library indicates a high probability that additional 
portions of the same transcription unit will be present as well. 
5 A critical piece of data that is required when characterizing novel genes is the length, 

in nucleotides, of the processed transcript or messenger RNA (mRNA). One skilled in the art 
primarily determines the length of an mRNA by Northern blot hybridization (Sambrook et 
ah, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor NY (1989)). Groups of ESTs and direct-selected cDNA clones that displayed 

10 significant sequence similarity to sequenced BACs in the critical region were grouped for 
convenience into approximately 30 kilobase units. Within each 30 kilobase unit there were 
from one up to fifty ESTs and direct-selected cDNA clones which comprised one or more 
independent transcription units. One or more ESTs or direct-selected cDNAs were used as 
hybridization probes to determine the length of the mRNA in a variety of tissues, using 

15 commercially available reagents (Multiple Tissue Northern blot; Clontech, Palo Alto, 
California) under conditions recommended by the manufacturer. 

Directionally cloned cDNA libraries from femoral bone, and calvarial bone tissue ; 
were constructed by methods familiar to one skilled in the art (for example, Soares in 
Automated DNA Sequencing and Analysis, Adams, Fields and Venter, Eds., Academic Press, 

20 NY, pages 110-114 (1994)). Bones were initially broken into fragments with a hammer, and 
the small pieces were frozen in liquid nitrogen and reduced to a powder in a tissue pulverizer 
(Spectrum Laboratory Products). RNA was extracted from the powdered bone by 
homogenizing the powdered bone with a standard Acid Guanidinium 
Ttaocyanate-Phenol-Chlorofonn extraction buffer (e.g., Chomczynski et aL, 1987 Anal 

25 Biochem. 162: 156-9) using a polytron homogenizer (Brinkman Instruments). Additionally, 
human brain and lung total RNA was purchased from Clontech. PolyA RNA was isolated 
from total RNA using dynabeads-dT according to the manufacturer's recommendations 
(Dynal, Inc.). First strand cDNA synthesis was initiated using an oligonucleotide primer with 
the sequence: 5'- A ACTGGAAG AATTC GCGGCCGC A GO A ATTTTTTTTTTTTTTTT 

30 TT-3' (SEQ ID NO:27). This primer introduces a Notl restriction site (underlined) at the 3\ 
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end of the cDNA. First and second strand synthesis were performed using the "one-tube 11 
cDNA synthesis kit as described by the manufacturer (Life Technologies, Bethesda, MD). 
Double stranded cDNAs were treated with T4 polynucleotide kinase to ensure that the ends 
of the molecules were blunt (Soares in Automated DNA Sequencing and Analysis, Adams, 
5 Fields and Venter, Eds., Academic Press, NY, pages 110-1 14 (1994)), and the blunt ended 
cDNAs were then size selected by a Biogel column (Huynh et al in DNA Cloning, Vol. 1, 
Glover, Ed., HtL Press, Oxford, pages 49-78 (1985)) or with a size-sep 400 Sepharose 
column (Pharmacia, catalog # 27-5105-01). Only cDNAs of 400 base pairs or longer were 
used in subsequent steps. EcoRI adapters (sequence: 5 ! OH-AATTCGGCACGAG-OH 3 f 

10 (SEQ ID NO:28), and 5' p-CTCGTGCCG-OH 3' (SEQ ID NO:29)) were then ligated to the 
double stranded cDNAs by methods familiar to one skilled in the art (Soares, 1994). The 
EcoRI adapters were then removed from the 3 f end of the cDNA by digestion with Notl 
(Soares, 1994). The cDNA was then ligated into the plasmid vector pBluescript® H KS+ 
(Stratagene, La Jolla, California), and the ligated material was transformed into E. coli host 

15 DH10B or DH12S by electroporation methods familiar to one skilled in the art (Soares, 

1994). After growth overnight at 37 °C, DNA was recovered from the E. coli colonies after 
scraping the plates by processing as directed for the Mega-prep kit (Qiagen, Chatswbrth, 
California). The quality of the cDNA libraries was estimated by counting a portion of the 
total numbers of primary transformants and determining the average insert size and the 

20 percentage of plasmids with no cDNA insert. Additional cDNA libraries (human total brain, 
heart, kidney, leukocyte, and fetal brain) were purchased from Life Technologies, Bethesda, 
MD. 

cDNA libraries, both oligo (dT) and random hexamer (N 6 ) primed, were used for 
isolating cDNA clones transcribed within the HBM region: human bone, human brain, human 

25 kidney and human skeletal muscle (all cDNA libraries were made by the inventors, except for 
skeletal muscle (dT) and kidney (dT) cDNA libraries). Four 10x10 arrays of each of the 
cDNA libraries were prepared as follows: the cDNA libraries were titered to 2.5 x 10 6 using 
primary transformants. The appropriate volume of frozen stock was used to inoculate 2 L of 
LB/ampicillin (100 mg/ml). This inoculated liquid culture was aliquoted into 400 tubes of 4 

30 ml each. Each tube contained approximately 5000 cfu. The tubes were incubated at 30°C 
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overnight with gentle agitation. The cultures were grown to an OD of 0.7-0.9. Frozen stocks 
were prepared for each of the cultures by allocating 100 (il of culture and 300 of 80% 
glycerol. Stocks were frozen in a dry ice/ethanol bath and stored at -70°C. The remaining 
culture was DNA prepared using the Qiagen (Chatsworth, CA) spin miniprep kit according to 
5 the manufacturer's instructions. The DNAs from the 400 cultures were pooled to make 80 
column and row pools. The cDNA libraries were determined to contain HBM cDNA clones 
of interest by PCR. Markers were designed to amplify putative exons. Once a standard PCR 
optimization was performed and specific cDNA libraries were determined to contain cDNA 
clones of interest, the markers were used to screen the arrayed library. Positive addresses 
1 0 indicating the presence of cDNA clones were confirmed by a second PCR using the same 
markers. 

Once a cDNA library was identified as likely to contain cDNA clones corresponding 
to a specific transcript of interest from the HBM region, it was manipulated to isolate the 
clone or clones containing cDNA inserts identical to the EST or direct-selected cDNA of 

15 interest. This was accomplished by a modification of the standard "colony screening" 

method (Sambrook et aL, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor ; 
Laboratory, Cold Spring Harbor NY (1989)). Specifically, twenty 150 mm LBH-ampicillin 
agar plates were spread with 20,000 colony forming units (cfii) of cDNA library and the 
colonies allowed to grow overnight at 37 °C. Colonies were transferred to nylon filters 

20 (Hybond from Amersham, or equivalent) and duplicates prepared by pressing two filters 

together essentially as described (Sambrook et al. 9 Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratory, Cold Spring Harbor NY (1989)). The "master" plate was 
then incubated an additional 6-8 hours to allow the colonies to grow back. The DNA from 
the bacterial colonies was then affixed to the nylon filters by treating the filters sequentially 

25 with denaturing solution (0.5 N NaOH, 1.5 M NaCl) for two minutes, neutralization solution 
(0.5 M Tris-Cl pH 8.0, 1.5 M NaCl) for two minutes (twice). The bacterial colonies were 
removed from the filters by washing in a solution of 2X SSC/0.1% SDS for one minute while 
rubbing with tissue paper. The filters were air dried and baked under vacuum at 80 °C for 1-2 
hours. 

30 A cDNA hybridization probe was prepared by random hexamer labeling (Fineberg 
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and Vogelstein, Anal Biochem., 132:6-13 (1983)) or by including gene-specific primers and 
no random hexamers in the reaction (for small fragments). Specific activity was calculated 
and was >5xl0 8 cpm/10 8 |Lig of cDNA. The colony membranes were then prewashed in 10 
mM Tris-Cl pH 8.0, 1 M NaCl, 1 mM EDTA, 0.1% SDS for 30 minutes at 55 °C. Following 
5 the prewash, the filters were prehybridized in > 2 ml/filter of 6X SSC, 50 % deionized 

formamide, 2% SDS, 5X Denhardt's solution, and 100 mg/ml denatured salmon sperm DNA, 
at 42 °C for 30 minutes. The filters were then transferred to hybridization solution (6X SSC, 
2% SDS, 5X Denhardt's, 100 mg/ml denatured salmon sperm DNA) containing denatured 
a 32 P-dCTP-labeled cDNA probe and incubated at 42°C for 16-18 hours. 
10 After the 16-18 hour incubation, the filters were washed under constant agitation in 

2X SSC, 2% SDS at room temperature for 20 minutes, followed by two washes at 65 °C for 
15 minutes each. A second wash was performed in 0.5 X SSC, 0.5% SDS for 15 minutes at 
65 °C. Filters were then wrapped in plastic wrap and exposed to radiographic film for several 
hours to overnight. After film development, individual colonies on plates were aligned with 
15 the autoradiograph so that they could be picked into a 1 ml solution of LB Broth containing 
ampicillin. After shaking at 37°C for 1-2 hours, aliquots of the solution were plated on 150 
mm plates for secondary screening. Secondary screening was identical to primary screening 
(above) except that it was performed on plates containing -250 colonies so that individual 
colonies could be clearly identified for picking. 
20 After colony screening with radiolabeled probes yielded cDNA clones, the clones 

were characterized by restriction endonuclease cleavage, PCR, and direct sequencing to 
confirm the sequence identity between the original probe and the isolated clone. To obtain 
the full-length cDNA, the novel sequence from the end of the clone identified was used to 
probe the library again. This process was repeated until the length of the cDNA cloned 
25 matches that estimated to be full-length by the northern blot analysis. 

RT-PCR was used as another method to isolate full length clones. The cDNA was 
synthesized and amplified using a "Superscript One Step RT-PCR" kit (Life Technologies, 
Gaithersburg, MD). The procedure involved adding 1.5 \ig of RNA to the following: 25 jxl of 
reaction mix provided which is a proprietary buffer mix with MgS0 4 and dNTP's> 1 \i\ sense 
30 primer (10 \iM) and 1 fil anti-sense primer (10 nM), 1 jil reverse transcriptase and Taq DNA 
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polymerase mix provided and autoclaved water to a total reaction mix of 50 |il. The reaction 
was then placed in a thermocycler for 1 cycle at 50°C for 15 to 30 minutes, then 94°C for 15 
seconds, 55-60°C for 30 seconds and 68-72°C for 1 minute per kilobase of anticipated 
product and finally 1 cycle of 72°C for 5-10 minutes. The sample was analyzed on an 
5 agarose gel. The product was excised from the gel and purified from the gel (GeneClean, Bio 
101). The purified product was cloned in pCTNR (General Contractor DNA Cloning 
System, 5 Prime - 3 Prime, Inc.) and sequenced to verify that the clone was specific to the 
gene of interest. 

Rapid Amplification of cDNA ends (RACE) was performed following the 

10 manufacturer's instructions using a Marathon cDNA Amplification Kit (Clontech, Palo Alto, 
CA) as a method for cloning the 5' and 3 ! ends of candidate genes. cDNA pools were 
prepared from total RNA by performing first strand synthesis, where a sample of total RNA 
sample was mixed with a modified oligo (dT) primer, heated to 70 °C, cooled on ice and 
followed by the addition of: 5X first strand buffer, 10 mM dNTP mix, and AMV Reverse 

15 Transcriptase (20 U/fil). The tube was incubated at 42°C for one hour and then the reaction 
tube was placed on ice. For second strand synthesis, the following components were added ; 
directly to the reaction tube: 5x second strand buffer, 10 mM dNTP mix, sterile water, 20X 
second strand enzyme cocktail and the reaction tube was incubated at 16 °C for 1.5 hours. T4 
DNA Polymerase was added to the reaction tube and incubated at 16 °C for 45 minutes. The 

20 second-strand synthesis was terminated with the addition of an EDTA/Glycogen mix. The 
sample was subjected to a phenol/chloroform extraction and an ammonium acetate 
precipitation. The cDNA pools were checked for quality by analyzing on an agarose gel for 
size distribution. Marathon cDNA adapters (Clontech) were then ligated onto the cDNA 
ends. The specific adapters contained priming sites that allowed for amplification of either 5' 

25 or 3' ends, depending on the orientation of the gene specific primer (GSP) that was chosen. 
An aliquot of the double stranded cDNA was added to the following reagents: 10 jo.M 
Marathon cDNA adapter, 5x DNA ligation buffer, T4 DNA ligase. The reaction was 
incubated at 16°C overnight. The reaction was heat inactivated to terminate the reaction. 
PCR was performed by the addition of the following to the diluted double stranded cDNA 

30 pool: 10X cDNA PCR reaction buffer, 10 p.M dNTP mix, 10 jiM GSP, 10 pM API primer 
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(kit), 50x Advantage cDNA Polymerase Mix. Thermal Cycling conditions were 94 °C for 30 
seconds, 5 cycles of 94°C for 5 seconds, 72°C for 4 minutes, 5 cycles of 94°C for 5 seconds, 
70 °C for 4 minutes, 23 cycles of 94 °C for 5 seconds, 68 °C for 4 minutes. After the first 
round of PCR was performed using the GSP to extend to the end of the adapter to create the 
adapter primer binding site, exponential amplification of the specific cDNA of interest was 
observed. Usually a second nested PCR is performed to confirm the specific cDNA. The 
RACE product was analyzed on an agarose gel and then excised and purified from the gel 
(GeneClean, BIO 101). The RACE product was then cloned into pCTNR (General 
Contractor DNA Cloning System, 5' - 3», Inc.) and the DNA sequence determined to verify 
that the clone is specific to the gene of interest. 

13. Mutation Analysis 

Comparative genes were identified using the above procedures and the exons from 
each gene were subjected to mutation detection analysis. Comparative DNA sequencing was 
used to identify polymorphisms in HBM candidate genes from chromosome 1 lql2-13. DNA 
sequences for candidate genes were amplified from patient lymphoblastoid cell lines. 

The inventors developed a method based on analysis of direct DNA sequencing of 
PCR products amplified from candidate regions to search for the causative polymorphism. 
The procedure consisted of three stages that used different subsets of HBM family to find 
segregating polymorphisms and a population panel to assess the frequency of the 
polymorphisms. The family resources result from a single founder leading to the assumption 
that all affected individuals will share the same causative polymorphism. 

Candidate regions were first screened in a subset of the HBM family consisting of the 
proband, daughter, and her mother, father and brother. Monochromosomal reference 
sequences were produced concurrently and used for comparison. The mother and daughter 
carried the HBM polymorphism in this nuclear family, providing the ability to monitor 
polymorphism transmission. The net result is that two HBM chromosomes and six non- 
HBM chromosomes were screened. This allowed exclusion of numerous frequent alleles. 
Only alleles exclusively present in the affected individuals passed to the next level of 
analysis. 
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Polymorphisms that segregated exclusively with the HBM phenotype in this original 
family were then re-examined in an extended portion of the HBM pedigree consisting of two 
additional nuclear families. These families consisted of five HBM and three unaffected 
individuals. The HBM individuals in this group included the two critical crossover 
5 individuals, providing the centromeric and telomeric boundaries of the critical region. 

Tracking the heredity of polymorphisms between these individuals and their affected parents 
allowed for further refining of the critical region. This group brought the total of HBM 
chromosomes screened to seven and the total of non-HBM chromosomes to seventeen. 
When a given polymorphism continued to segregate exclusively with the HBM 

10 phenotype in the extended group, a population panel was then examined. This panel of 84 
persons consisted of 42 individuals known to have normal bone mineral density and 42 
individuals known to be unrelated but with untyped bone mineral density. For this purpose, 
normal bone mineral density is within two standard deviations of a BMD Z score of 0. The 
second group was from the widely used CEPH panel of individuals. Any segregating 

15 polymorphisms found to be rare in this population were subsequently examined on the entire 
HBM pedigree and a larger population. i 

Polymerase chain reaction (PCR) was used to generate sequencing templates from the 
HBM family's DNA and monochromosomal controls. Enzymatic amplification of genes 
within the HBM region on 1 lq 12- 13 was accomplished using the PCR with oligonucleotides 

20 flanking each exon as well as the putative 5' regulatory elements of each gene. The primers 
were chosen to amplify each exon as well as 15 or more base pairs within each intron on 
either side of the splice. All PCR primers were made as chimeras to facilitate dye primer 
sequencing. The M13-21F (5 ! - GTA A CGA CGG CCA GT -3») (SEQ ID NO:30) and - 
28REV (5'- AAC AGC TAT GAC CAT G -3') (SEQ ID NO:31) primer binding sites were 

25 built on to the 5 1 end of each forward and reverse PCR primer, respectively, during synthesis. 
1 50 ng of genomic DNA was used in a 50 jil PCR with 2 U AmpliTaq, 500 nM primer and 
125 \xM dNTP. Buffer and cycling conditions were specific to each primer set. TaqStart 
antibody (Clontech) was used for hot start PCR to minimize primer dimer formation. 10% of 
the product was examined on an agarose gel. The appropriate samples were diluted 1:25 with 

30 deionized water before sequencing. 
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Each PCR product was sequenced according to the standard Energy Transfer primer 
(Amersham) protocol. All reactions took place in 96 well trays. 4 separate reactions, one 
each for A, C, G and T were performed for each template. Each reaction included 2 jal of the 
sequencing reaction mix and 3 jutl of diluted template. The plates were then heat sealed with 
5 foil tape and placed in a thermal cycler and cycled according to the manufacturer's 

recommendation. After cycling, the 4 reactions were pooled. 3 p.1 of the pooled product was 
transferred to a new 96 well plate and 1 jxl of the manufacturer's loading dye was added to 
each well. All 96 well pipetting procedures occurred on a Hydra 96 pipetting station 
(Robbins Scientific, USA). 1 pi of pooled material was directly loaded onto a 48 lane gel 

10 running on an ABI 377 DNA sequencer for a 10 hour, 2.4 kV run. 

Polyphred (University of Washington) was used to assemble sequence sets for 
viewing with Consed (University of Washington). Sequences were assembled in groups 
representing all relevant family members and controls for a specified target region. This was 
done separately for each of the three stages. Forward and reverse reads were included for 

15 each individual along with reads from the monochromosomal templates and a color annotated 
reference sequence. Polyphred indicated potential polymorphic sites with a purple flag. Two 
readers independently viewed each assembly and assessed the validity of the purple-flagged 
sites. 

A total of 23 exons present in the mature nxRNA and several other portions of the 
20 primary transcript were evaluated for heterozygosity in the nuclear family of two HBM- 
affected and two unaffected individuals. 25 SNPs were identified, as shown in the table 
below. 

TABLE 8 

Single Nucleotide Polymorphisms in the Zmaxl (LRPS) Gene and Environs 

25 

Exon Name Location Base Change 

b200e21-h_Contigl_l.nt 69169 (309G) C/A 

30 b200e21-h_Contig4_12.nt 27402 (309G) A/G 

b200e21-h_Contig4__13.nt 27841 (309G) T/C 

b200e21-h_Contig4_16.nt 35600 (309G) A/G 



35 



b200e21-h_Contig4_21.nt 45619 (309G) G/A 
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Exon Name 


Location 


Base Change 




b200e21-h_Contig4_22.nt-a 


46018 (309G) 


T/G 


5 


b200e2 1 -h_Contig4_22.nt-b 


46093 (309G) 


T/G 




b200e21-h_Contig4_22.nt-c 


46190 (309G) 


A/G 




b200e2 1 -h_Contig4_24.nt-a 


50993 (309G) 


T/C 


10 


b200e2 1 -h_Contig4_24.nt-b 


51124 (309G) 


C/T 




b200e2 1 -h_Contig4_25 .nt 


55461 (309G) 


C/T 


J. 


b200e2 1 -h_Contig4_33 .nt-a 


63645 (309G) 


CIA 


b200e2 1 -h_Contig4_33 .nt-b 


63646 (309G) 


AJC 




b200e21-h_Contig4_61 .nt 


24809 (309G) 


T/G 


20 


b200e2 1 -h_Contig4_62.nt 


27837 (309G) 


T/C 




b200e2 1 -h_Contig4_63 .nt-a 


31485 (309G) 


C/T 


25 


b200e2 1 -h_Contig4_63 .nt-b 


31683 (309G) 


A/G 


b200e2 1 -h_Contig4_9.nt 


24808 (309G) 


T/G 




b527dl2-h_Contig030g_l .nt-a 


31340 (308G) 


T/C 


30 


b527dl2-h_Contig030g_l .nt-b 


32538 (308G) 


A/G 




b527dl2-h_Contig080C_2.nt 


13224 (308G) 


A/G 


35 


b527dl2-h_Contig087C_l .nt 


21119 (308G) 


C/A 


b527dl2-h_Contig087C_4.nt 


30497 (308G) 


G/A 




b527dl2-h_Contig088C_4.nt 


24811 (309G) 


A/C 


40 


b527dl2-h_Contig089_lHP.nt 


68280 (309G) 


G/A 



In addition to the polymorphisms presented in Table 8, two additional polymorphisms can 
also be present in SEQ ID NO:2. These is a change at position 2002 of SEQ ID NO:2. Either 
a guanine or an adenine can appear at this position. This polymorphism is silent and is not 
45 associated with any change in the amino acid sequence. The second change is at position 
4059 of SEQ ID NO:2 corresponding in a cytosine (C) to thymine (T) change. This 
polymorphism results in a corresponding amino acid change from a valine (V) to an alanine 
(A). Other polymorphisms were found in the candidate gene exons and adjacent intron 
sequences as displayed in Tables 2 and 3. Any one or combination of the polymorphisms 
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listed in Tables 2, 3 or 8 or the two discussed above could also have a minor effect on bone 
mass when present in SEQ ID NO:2. These could also be in combination with any of the 
other mutations discussed in Section 3 and in the Examples below. 

The present invention encompasses the nucleic acid sequences having the nucleic acid 
5 sequence of SEQ ED NO: 1 with any one or more of the above-identified point mutations. 

Preferably, the present invention encompasses the nucleic acid of SEQ ID NO: 2 or is 
a mutation in SEQ ID NO: 1 that produces a phenotype like that observed with the protein 
encoded by SEQ ID NO:2 (HBM phenotype). Specifically, a base-pair substitution changing 
G to T at position 582 in the coding sequence of Zmaxl (the HBM gene) was identified as 

10 heterozygous in all HBM individuals, and not found in the unaffected individuals (i.e., 
b527dl2-h_Contig087C_l.nt). Fig. 6 shows the order of the contigs in B527D12. The 
direction of transcription for the HBM gene is from left to right. The sequence of contig308G 
of B527D12 is the reverse complement of the coding region to the iZffMgene. Therefore, the 
relative polymorphism in contig 308G shown in Table 8 as a base change substitution of C to 

15 A is the complement to the G to T substitution in the HBM gene. This mutation causes a 
substitution of glycine 171 with valine (G171 V). 

The HBM polymorphism was confirmed by examining the DNA sequence of different 
groups of individuals. In all members of the HBM pedigree (38 individuals), the HBM 
polymorphism was observed in the heterozygous form in affected (i.e., elevated bone mass) 

20 individuals only (N=l 8). In unaffected relatives (N=20) (BMDZ<2.0), the HBM 

polymorphism was never observed. To determine whether this polymorphism was ever 
observed in individuals outside of the HBM pedigree, 297 phenotyped individuals were 
characterized at the site of the HBM gene. None were heterozygous at the site of the HBM 
polymorphism. In an unphenotyped control group, none of 64 individuals were observed to 

25 be heterozygous at position 582. Taken together, these data prove that the polymorphism 

observed in the kindred displaying the high bone mass phenotype is strongly correlated with 
the G-T polymorphism at position 582 of Zmaxl (LRP5). Furthermore, these results 
coupled with the ASO results described below, establish that the HBM polymorphism 
genetically segregates with the HBM phenotype, and that both the HBM polymorphism and 
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phenotype are rare in the general population. 

13. Allele Specific Oligonucleotide (ASO) Analysis 

The amplicon containing the HBM polymorphism was PCR amplified using primers 
specific for the exon of interest. The appropriate population of individuals was PCR 
amplified in 96 well microtiter plates as follows. PCR reactions (20 jxl) containing IX 
Promega PCR buffer (Cat. # Ml 883 containing 1.5 mM MgCl 2 ), 100 mM dNTP, 200 nM 
PCR primers (1863F: S'-CCAAGTTCTGAGAAGTCC-S' and 1864R: 5'- 
AAT ACCTGAAACC AT ACCTG-3 1 ; SEQ ID NOS: 629 and 630 respectively), 1 U 
AmpliTaq™, and 20 ng of genomic DNA were prepared and amplified under the following 
PCR conditions: 94°C, 1 minute, (94°C, 30 sec; 58°C, 30 sec; 72°C, 1 min X35 cycles), 
72 °C, 5 min, 4°C, hold. Loading dye was then added and 10 jxl of the products was 
electrophoresed on 1.5% agarose gels containing 1 jxg/ml ethidium bromide at 100-150 V for 
5-10 minutes. Gels were treated 20 minutes in denaturing solution (1.5 M NaCl, 0.5 N 
NaOH), and rinsed briefly with water. Gels were then neutralized in 1 M Tris-HCl, pH 7.5, 
1 .5 M NaCl, for 20 minutes and rinsed with water. Gels were soaked in 10 X SSC for 20 
minutes and blotted onto nylon transfer membrane (Hybond N+- Amersham) in 10X SSC 
overnight. Filters were the rinsed in 6X SSC for 10 minutes and UV crosslinked. 

The allele specific oligonucleotides (ASO) were designed with the polymorphism 
approximately in the middle. Oligonucleotides were phosphate free at the 5' end and were 
purchased from Gibco BRL. Sequences of the oligonucleotides are: 

2326 Zmaxl.ASO.g: 5-AGACTGGGGTGAGACGC-3 1 (SEQ ID NO:631) 

2327 Zmaxl.ASO.t: 5 , -CAGACTGGGlTGAGACGCC-3 , (SEQ ID NO:632) 
The polymorphic nucleotides are underlined. To label the oligos, 1.5 \x\ of 1 |ig/|il 

ASO oligo (2326.Zmaxl.ASO.g or 2327.Zmaxl.ASO.t), 1 1 j-il ddH 2 0, 2 ^1 10X kinase 
forward buffer, 5 yl y 32 P-ATP (6000 Ci/mMole), and 1 jxl T4 polynucleotide kinase (10 U/jxl) 
were mixed, and the reaction incubated at 37°C for 30-60 minutes. Reactions were then 
placed at 95 °C for 2 minutes and 30 ml H 2 0 was added. The probes were purified using a 
G25 microspin column (Pharmacia). 
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Blots were prehybridized in 10 ml 5X SSPE, 5X Denhardt's, 2% SDS, and 100 
jig/ml, denatured, sonicated salmon sperm DNA at 40 °C for 2 hr. The entire reaction mix of 
kinased oligo was then added to 10 ml fresh hybridization buffer (5X SSPE, 5X Denhardt's, 
2% SDS) and hybridized at 40 °C for at least 4 hours to overnight. 
5 All washes done in 5X SSPE, 0. 1 % SDS. The first wash was at 45 °C for 1 5 minutes; 

the solution was then changed and the filters washed 50 °C for 15 minutes. Filters were then 
exposed to Kodak biomax film with 2 intensifying screens at -70°C for 15 minutes to 1 hr. 
If necessary the filters were washed at 55 °C for 15 minutes and exposed to film again. 
Filters were stripped by washing in boiling 0.1X SSC, 0.1% SDS for 10 minutes at least 3 
10 times. 

The two films that best captured the allele specific assay with the 2 ASOs were 
converted into digital images by scanning them into Adobe PhotoShop. These images were 
overlaid against each other in Graphic Converter and then scored and stored in FileMaker Pro 
4.0 (see Fig. 9). 

15 In order to determine the HBM allele frequency in ethnically diverse populations, 672 

random individuals from various ethnic groups were typed by the allele specific 
oligonucleotide (ASO) method. This population included 96 CEPH grandparents (primarily 
Caucasian), 192 Caucasian, 192 African-American, 96 Hispanic, and 96 Asian individuals. 
No evidence was obtained for the presence of the HBM polymorphism in any of these 

20 individuals. Overall, a total of 91 1 individuals were typed either by direct sequencing or 
ASO hybridization; all were homozygous GG at the site of the HBM polymorphism (Fig. 
14). This information illustrates that the HBM allele is rare in various ethnic populations. 

Thus this invention provides a rapid method of identifying individuals with the HBM 
allele. This method could be used in the area of diagnostics and screening of an individual 

25 for susceptibility to osteoporosis or other bone disorder. The assay could also be used to 
identify additional individuals with the HBM allele or the additional polymorphisms 
described herein. 
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14. Cellular Localization of Zmaxl (LRP5) 

14.1 Gene Expression in rat-tibia by non isotopic in situ Hybridization 
In situ hybridization was conducted by Pathology Associates International (PAI), 
Frederick, MD. This study was undertaken to determine the specific cell types that express 
5 the Zmaxl (LRP5) gene in rat bone with particular emphasis on areas of bone growth and 
remodeling. Zmaxl (LRP5) probes used in this study were generated from both human 
(HuZmaxl) and mouse (MsZmaxl) cDNAs, which share an 87% sequence identity. The 
homology of human and mouse Zmaxl (LRP5) with rat Zmaxl (LRP5) is unknown. 

For example, gene expression by non-isotopic in situ hybridization was performed as 
10 follows, but other methods would be known to the skilled artisan. Tibias were collected from 
two 6 to 8 week old female Sprague Dawley rats euthanized by carbon dioxide asphyxiation. 
Distal ends were removed and proximal tibias were snap frozen in OCT embedding medium 
with liquid nitrogen immediately following death. Tissues were stored in a -80 °C freezer. 
Probes for amplifying PCR products from cDNA were prepared as follows. The 
15 primers to amplify PCR products from a cDNA clone were chosen using published sequences 
of both human LRP5 (GenBank Accession No. AB017498) and mouse LRP5 (GenBank 
Accession No. AF064984). In order to minimize cross reactivity with other genes in the LDL 
receptor family, the PCR products were derived from an intracellular portion of the protein 
coding region. PCR was performed in a 50 \xl reaction volume using cDNA clone as 
20 template. PCR reactions contained 1 .5 mM MgCl 2 , 1 unit AmpliTaq, 200 yM dNTPs and 2 
jiM each primer. PCR cycling conditions were 94°C for 1 min., followed by 35 cycles of 
94°C for 30 seconds, 55 °C for 30 seconds, 72°C for 30 seconds; followed by a 5 minute 
extension at 72 °C. The reactions were then run on a 1.5% agarose Tris-Acetate gel. DNA 
was eluted from the agarose, ethanol precipitated and resuspended in 10 mM Tris, pH 8.0. 
25 Gel purified PCR products were prepared for both mouse and human cDNAs and supplied to 
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Pathology Associates International for in situ hybridizations. 

The sequence of the human and mouse PCR primers and products were as follows: 
Human Zmaxl a,RPT> sense, prim er rHRM?.S^) 

5'-CCCGTGTGCTCCGCCGCCCAGTTC-3' (SEQ ID NO:633) 
Human Zmaxl rt RES) anrisenge p rimer (HTtM4rtS) 

5'-GGCTCACGGAGCTCATCATGGACTT-3' (SEQ ID NO:634) 
Human Zmaxl PCR prnHnnt 

5*-CCCGTGTGCTCCGCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCT 

GCGCCTGCGCTGCGACGGCGAGGCAGACTGTCAGGACCGCTCAGACGAGGTGG 

ACTGTGACGCCATCTGCCTGCCCAACCAGTTCCGGTGTGCGAGCGGCCAGTGTGT 

CCTCATCAAACAGCAGTGCGACTCCTTCCCCGACTGTATCGACGGCTCCGACGAG 

CTCATGTGTGAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACAGCAGT 

GCCATCGGGCCCGTCATTGGCATCATCCTCTCTCTCTTCGTCATGGGTGGTGTCTA 

TTTTGTGTGCCAGCGCGTGGTGTGCCAGCGCTATGCGGGGGCCAACGGGCCCTTC 

CCGCACGAGTATGTCAGCGGGACCCCGCACGTGCCCCTCAATTTCATAGCCCCGG 

GCGGTTCCCAGCATGGCCCCTTCACAGGCATCGCATGCGGAAAGTCCATGATGA 
GCTCCGTGAGCC-3* (SEQ ID NO:635) 

Mouse Zmaxl (T,RP5-) Sense primer (HBM655) 

S'-AGCGAGGCCACCATCCACAGG-S' (SEQ ID NO:636) 
Mouse Zmaxl fLRP5) antisense pri mer (PreM65fi) 

5-TCGCTGGTCGGC ATAATC AAT-3 ' (SEQ ID NO:637) 
Mouse tt,RP5^ 1 PCR product 

5'-AGCAGAGCCACCATCCACAGGATCTCCCTGGAGACTAACAACAACGATGTGG 
CTATCCCACTCACGGGTGTCAAAGAGGCCTCTGCACTGGACTTTGATGTGTCCAA 
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CAATCACATCTACTGGACTGATGTTAGCCTCAAGACGATCAGCCGAGCCTTCATG 
AATGGGAGCTCAGTGGAGCACGTGATTGAGTTTGGCCTCGACTACCCTGAAGGA 
ATGGCTGTGGACTGGATGGGCAAGAACCTCTATTGGGCGGACACAGGGACCAAC 
AGGATTGAGGTGGCCCGGCTGGATGGGCAGTTCCGGCAGGTGCTTGTGTGGAGA 
5 GACCTTGACAACCCCAGGTCTCTGGCTCTGGATCCTACTAAAGGCTACATCTACT 
GGACTGAGTGGGGTGGCAAGCCAAGGATTGTGCGGGCCTTCATGGATGGGACCA 
ATTGTATGACACTGGTAGACAAGGTGGGCCGGGCCAACGACCTCACCATTGATT 
ATGCCGACCAGCGA-3' (SEQ ID NO:638) 

Riboprobes were synthesized as follows. The PCR products were reamplified with 

10 chimeric primers designed to incorporate either a T3 promoter upstream, or a T7 promoter 
downstream of the reamplification products. The resulting PCR products were used as 
template to synthesize digoxigenin-labeled riboprobes by in vitro transcription (TVT). 
Antisense and sense riboprobes were synthesized using T7 and T3 RNA polymerases, 
respectively, in the presence of digoxigenin-1 1-UTP (Boehringer-Mannheim) using a 

1 5 MAXlscript IVT kit (Ambion) according to the manufacturer. The DNA was then degraded 
with Dnase-1, and unincorporated digoxigenin was removed by ultrafiltration. Riboprobe 
integrity was assessed by electrophoresis through a denaturing polyacrylamide gel. 
Molecular size was compared with the electrophoretic mobility of a 100-1000 base pair (bp) 
RNA ladder (Ambion). Probe yield and labeling was evaluated by blot immunochemistry. 

20 Riboprobes were stored in 5 aliquots at -80 °C. 

The in situ hybridization was performed as follows. Frozen rat bone was cut into 5 
\iM sections on a Jung CM3000 cryostat (Leica) and mounted on adhesive slides 
(Instrumedics). Sections were kept in the cryostat at -20°C until all the slides were prepared 
in order to prevent mRNA degradation prior to post-fixation for 15 minutes in 4% 

25 paraformaldehyde. Following post-fixation, sections were incubated with 1 ng/nl of either 
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antisense or sense riboprobe in Pathology Associates International (PAI) customized 
hybridization buffer for approximately 40 hours at 58 °C. Following hybridization, slides 
were subjected to a series of post-hybridization stringency washes to reduce nonspecific 
probe binding. Hybridization was visualized by immunohistochemistry with an anti- 
5 digoxigenin antibody (FAB fragment) conjugated to alkaline phosphatase. Nitroblue 

tetrazolium chloride/bromochloroindolyl phosphate (Boehrmger-Mannheim), a precipitating 
alkaline phosphatase substrate, was used as the chromogen to stain hybridizing cells purple to 
nearly black, depending on the degree of staining. Tissue sections were counter-stained with 
nuclear fast red. Assay controls included omission of the probe, omission of probe and anti- 

10 digoxigenin antibody. 

Specific cell types were assessed for demonstration of hybridization with antisense 
probes by visualizing a purple to black cytoplasmic and/or peri-nuclear staining indicating a 
positive hybridization signal for mRNA. Each cell type was compared to the replicate 
sections, which were hybridized with the respective sense probe. Results were considered 

15 positive if staining was observed with the antisense probe and no staining or weak 
background with the sense probe. 

The cellular localization of the hybridization signal for each of the study probes is 
summarized in Table 9. Hybridization for Zmaxl (LRP5) was primarily detected in areas of 
bone involved in remodeling, including the endosteum and trabecular bone within the 

20 metaphysis. Hybridization in selected bone lining cells of the periosteum and epiphysis were 
also observed. Positive signal was also noted in chondrocytes within the growth plate, 
particularly in the proliferating chondrocytes. See Figs. 10, 1 1 and 12 for representative 
photomicrographs of in situ hybridization results. 
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TABLE 9 



10 



15 



20 



25 



30 



35 



PROBE 


SITE 


TSH SIQN AJL 


Hu Zmaxl 


Etrinhvsis 
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Osteoclasts 






Growth Plate 
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Ostpnhlfl^ts 
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Osteoplasts; 






Growth Plate 






resting chondrocytes 






proliferating chondrocytes 


+ 




hypertrophic chondrocytes 






Metaphysis 






osteoblasts 


+ j 




osteoclasts 


+ 




Diaphysis 






Endosteum 






osteoblasts 


+ 




osteoclasts 


+ 




Periosteum 


+ 









Legend: "+" = hybridization signal detected = no hybridization signal detected 
"ISH" - In situ hybridization 



40 



These studies confirm the positional expression of Zmaxl (LRP5) in cells involved in 
bone remodeling and bone formation. Zmaxl (LRP5) expression in the zone of proliferation 
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and in the osteoblasts and osteoclasts of the proximal metaphysis, suggests that the Zmaxl 
(LRP5) gene is involved in the process of bone growth and mineralization. The activity and 
differentiation of osteoblasts and osteoclasts are closely coordinated during development as 
bone is formed and during growth as well as in adult life as bone undergoes continuous 
remodeling. The formation of internal bone structures and bone remodeling result from the 
coupling of bone resorption by activated osteoclasts with subsequent deposition of new 
material by osteoblasts. Zmaxl (LRP5) is related to the LDL receptor gene, and thus may be 
a receptor involved in mechanosensation and subsequent signaling in the process of bone 
remodeling. Therefore, changes in the level of expression of this gene could impact on the 
rate of remodeling and degree of mineralization of bone. 

15. Antisense 

Antisense oligonucleotides are short synthetic nucleic acids that contain 
complementary base sequences to a targeted RNA. Hybridization of the RNA in living cells 
with the antisense oligonucleotide interferes with RNA function and ultimately blocks protein 
expression. Therefore, any gene for which the partial sequence is known can be targeted by 
an antisense oligonucleotide. 

Antisense technology is becoming a widely used research tool and will play an 
increasingly important role in the validation and elucidation of therapeutic targets identified 
by genomic sequencing efforts. 

Antisense technology was developed to inhibit gene expression by utilizing an 
oligonucleotide complementary to the mRNA that encodes the target gene. There are several 
possible mechanisms for the inhibitory effects of antisense oligonucleotides. Among them, 
degradation of mRNA by RNase H is considered to be the major mechanism of inhibition of 
protein function. This technique was originally used to elucidate the function of a target 
gene, but may also have therapeutic applications, provided it is designed carefully and 
properly. 

An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 
45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be constructed 
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using chemical synthesis and enzymatic ligation reactions using procedures known in the art. 
For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically 
synthesized using naturally occurring nucleotides or variously modified nucleotides designed 
to increase the biological stability of the molecules or to increase the physical stability of the 
5 duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. Examples of modified 
nucleotides which can be used to generate the antisense nucleic acid include 5-fluorouracil, 5- 
bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaininomethyl-2-thiouridine, 5- 

10 carboxymethlaminomethyluracil, dihydrouracil, P-D-galactosylqueosine, inosine, N6- 
isopentenyladenine, I-methylguanine, I-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 

15 isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5 -methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil -5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), t-methyl-2-thioxiracil, 3-(3-amino-3- 
N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

In addition, the use of morpholino oligonucleotides could be employed. Morpholinos 

20 are oligomers with modification of the ribose moiety to a morpholino group. This technology 
is covered by U.S. Patent No. 5,185,444 and is described in Summerton et ah, 1997 Antisense 
Nucleic Acid Drug Dev. 7(3): 1 87-95 . The antisense nucleic acid molecules of the invention 
are typically administered to a subject or generated in situ such that they hybridize with or 
bind to cellular mRNA and/or genomic DNA encoding an HBM or Zmaxl (LRP5) protein or 

25 a protein which interacts with Zmaxl (LRP5) and/or HBM to thereby inhibit expression of 
the protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the case 
of an antisense nucleic acid molecule which binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An example of a route of administration 
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of an antisense nucleic acid molecule of the invention includes direct injection at a tissue site. 
Alternatively, an antisense nucleic acid molecule can be modified to target selected cells and 
then administered systemically. For example, for systemic administration, an antisense 
molecule can be modified such that it specifically binds to a receptor or an antigen expressed 
5 on a selected cell surface, e.g., by linking the antisense nucleic acid molecule to a peptide or 
an antibody which binds to a cell surface receptor or antigen. The antisense nucleic acid 
molecule can also be delivered to cells using the vectors described herein. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
a-anomeric nucleic acid molecule. An n-anomeric nucleic acid molecule forms specific 

10 double-stranded hybrids with complementary RNA in which, contrary to the usual y-units, 
the strands run parallel to each other (Gaultier et al, 1987 Nucl Acids Res. 15: 6625-41). 
The antisense nucleic acid molecule can also comprise a 2 f -o-methylribonucleotide (Inoue et 
al., 1987 Nucl Acids Res. 15: 6131-6148) or a chimeric RNA-DNA analogue (Inoue et al., 
1987 FEBS Lett. 215: 327-330). In still another embodiment, an antisense nucleic acid of the 

15 invention is a ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity 
which are capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which 
they have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes 
(described in Haselhoff et al, 1988 Nature 33 4: 585-591) can be used to catalytically cleave 
Zmaxl (LRP5) or HBM mRNA transcripts to thereby inhibit translation of Zmaxl (LRP5) or 

20 HBM mRNA. A ribozyme having specificity for a Zmaxl - or HBM-encoding nucleic acid 
can be designed based upon the nucleotide sequence of a Zmaxl (LRP5) or HBM cDNA 
disclosed herein (i.e., SEQ ID NOS:l or 3). For example, a derivative of a Tetrahymena L-19 
IVSRNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved in an HBM or Zmaxl -encoding 

25 mRNA. See, e.g., Cech et al. U.S. Patent No. 4,987,071 and Cech et al. U.S. Patent No. 

5,1 16,742 both incorporated herein by reference. Alternatively, Zmaxl (LRP5) or HBM 
mRNA can be used to select a catalytic RNA having a specific ribonuclease activity from a 
pool of RNA molecules. See, e.g., Bartel et al, 1993 Science 261: 141 1-1418. Alternatively, 
Zmaxl (LRPS) or HBM gene expression can be inhibited by targeting nucleotide sequences 
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complementary to the regulatory region of the Zmaxl (LRPS)or HBM gene (e.g., the Zmaxl 
or HBM gene promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the Zmaxl (LRP5) or HBM genes in target cells. See generally, Helene, 1991 
Anticancer Drug Des. 6(6):569-84; Helene et al, 1992 Ann. NY. Acad Set 660:27-36; and 
5 Maher, 1992 Bioassays 14(12):807-15. 

Zmaxl (LRP5), LRP6, HBM-like and HBM gene expression can also be inhibited 
using RNA interference (RNAi) caused by small inhibitory RNAs (siRNAs). This is a 
technique for post-transcriptional gene silencing (PTGS), in which target gene activity is 
specifically abolished with cognate double-stranded RNA (dsRNA). siRNAs resemble in 

10 many aspects PTGS in plants and has been detected in many invertebrates including 

trypanosome, hydra, planaria, nematode and fruit fly (Drosophila melanogaster). It may be 
involved in the modulation of transposable element mobilization and antiviral state 
formation. RNAi in mammalian systems is disclosed in PCT application WO 00/63364 
which is incorporated by reference herein in its entirety. See also, Elbashir et aL, 2001 

15 Nature 411: 494-98. Basically, dsRNA, homologous to the target (Zmaxl or HBM) is 
introduced into the cell and a sequence specific reduction in gene activity is observed. 

Another embodiment contemplates the use of small hairpin RNAs (shRNAs). These 
compounds are described further in Yu et al, 2002 Proc. Natl Acad. Set USA 99: 6047-52; 
and Paddison et aL 9 2002 Genes & Devel 16: 948-58. 

20 As an example, preparing antisense oligonucleotides can be performed as follows. 

Studies have been undertaken using antisense technology in the osteoblast-like murine cell 
line, MC3T3. These cells can be triggered to develop along the bone differentiation 
sequence. An initial proliferation period is characterized by minimal expression of 
differentiation markers and initial synthesis of collagenous extracellular matrix. Collagen 

25 matrix synthesis is required for subsequent induction of differentiation markers. Once the 
matrix synthesis begins, osteoblast marker genes are activated in a clear temporal sequence: 
alkaline phosphatase is induced at early times while bone sialoprotein and osteocalcin appear 
later in the differentiation process. This temporal sequence of gene expression is useful in 
monitoring the maturation and mineralization process. Matrix mineralization, which does not 
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begin until several days after maturation has started, involves deposition of mineral on and 
within collagen fibrils deep within the matrix near the cell layer-culture plate interface. The 
collagen fibril-associated mineral formed by cultured osteoblasts resembles that found in 
woven bone in vivo and therefore is used frequently as a study reagent. 

MC3T3 cells were transfected with antisense oligonucleotides for the first week of the 
differentiation, according to the manufacturer's specifications (U.S. Patent No. 5,849,902). 
The oligonucleotides designed for Zmaxl (LRP5) are given below: 
10875: 5 f -AGUACAGCUUCUUGCCAACCCAGUC-3 , (SEQ ID NO:639) 
10876: 5 f -UCCUCCAGGUCGAUGGUCAGCCCAU-3' (SEQ ID NO:640) 
10877: 5 , -GUCUGAGUCCGAGUUCAAAUCCAGG-3 t (SEQ ID NO:641) 
Fig. 13 shows the results of antisense inhibition of (LRP5) 1 in MC3T3 cells. The 
three oligonucleotides shown above were transfected into MC3T3 and RNA was isolated 
according to standard procedures. Northern analysis clearly shows markedly lower steady 
state levels of the Zmaxl (LRP5) transcript while the control gene GAPDH remained 
unchanged. Thus, antisense technology using the primers described above allows for the 
study of the role of Zmaxl (LRP5) expression on bone biology. 

16. Yeast Two Hybrid 

In order to identify the signaling pathway that Zmaxl (LRP5) participates in to 
modulate bone density, the yeast two hybrid protein interaction technology was utilized. 
This technique facilitates the identification of proteins that interact with one another by 
coupling tester proteins to components of a yeast transcription system (Fields et al, 1989 
Nature 340: 245-246; U.S. Pat. No. 5,283,173 by Fields et al; Johnston, 1987 Microbiol 
Rev. 51: 458-476; Keegan et al. 9 1986 Science 231: 699-704; Durfee et al, 1993, Genes Dev. 
7: 555-569; Chien et al, 1991 Proc. Natl Acad. Sci USA 88: 9578-9582; Fields et al, 1994 
Trends in Genetics 10: 286-292; and Gyuris et al, 1993 Cell 75: 791-803). First a "bait" 
protein, the protein for which one seeks interacting proteins, is fused to the DNA binding 
domain of a yeast transcription factor. Second, a cDNA library is constructed that contains 
cDNAs fused to the transcriptional activation domain of the same yeast transcription factor; 

102 



BNSDOCID: <WO 02092000A2J_> 



WO 02/092000 PCT/US02/14877 



this is termed the prey library. The bait construct and prey library are transformed into yeast 
cells and then mated to produce diploid cells. If the bait interacts with a specific prey from 
the cDNA library, the activation domain is brought into the vicinity of the promoter via this 
interaction. Transcription is then driven through selectable marker genes and growth on 
5 selective media indicates the presence of interacting proteins. 

The amino acid sequence used in the yeast two hybrid experiments discussed herein 
consisted of the entire cytoplasmic domain and a portion of the transmembrane domain and is 
shown below (amino to carboxy orientation): 

10 RWCQRYAGA NGPFPHEYVS GTPHVPLNFI APGGSQHGPF TGIACGKSMM 
SSVSLMGGRG GVPLYDRNHV TGASSSSSSS TKATLYPPT L, KTPPPSPA TDP 
SLYNMDMFYS SNIPATVRPY RPYIIRG MAP PTTP CSTDVC DSDYSASRWK 
AS KY YLDLNS DSDP YPPPPT EH SQYLSAE D S CP PS PA TER SYFHL£LEEEE 
£E£TDSS (SEQ ID NO: 765) 

15 

The last 6 amino acids of the putative transmembrane domain are indicated in bold. 
Putative SH3 domains are underlined. Additional amino acid sequences of 50 amino acids or 
greater in either the proteins encoded by the Zmaxl (LRP5) or HBM alleles can also be used 
as bait. The upper size of the polypeptide used as bait is limited only by the presence of a 

20 complete transmembrane domain (see Fig. 4), which will render the bait to be nonfunctional 
in a yeast two hybrid system. These additional bait proteins can be used to identify additional 
proteins which interact with the proteins encoded by HBM or Zmaxl (LRP5) in the focal 
adhesion signaling pathway or in other pathways in which these HBM or Zmaxl (LRP5) 
proteins may act. Once identified, methods of identifying agents which regulate the proteins 

25 in the focal adhesion signaling pathway or other pathways in which HBM acts can be 
performed as described herein for the HBM and Zmaxl (LRP5) proteins. 

In order to identify cytoplasmic Zmaxl (LRP5) signaling pathways, the cytoplasmic 
domain of Zmaxl (LRP5) was subcloned into two bait vectors. The first vector was pDBleu, 
which was used to screen a brain, and Hela prey cDNA library cloned into the vector pPC86 

30 (Clontech). The second bait vector used was pDBtrp, which was used to screen a cDNA prey 
library derived from the TE85 osteosarcoma cell line in vector pOP46. Another suitable 
vector which is widely available, is p86 (Gibco, iest™ System). Standard techniques known 
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to those skilled in the art were used as described in Fields and Song, 1989 Nature 340: 245- 
246; U.S. Pat No. 5,283,173 by Fields and Song; Johnston, 1987 Microbiol Rev. 51: 458- 
476; Keegan et al, 1986 Science 231: 699-704; Durfee etal, 1993 Genes Dev. 7: 555-569; 
Chien et ah, 1991 Proc. Natl Acad. Sci USA 88: 9578-9582; Fields et aL 9 1994 Trends in 
5 Genetics 10: 286-292; and Gyuris et aL, 1993 Cell 75: 791-803. The bait construct and prey 
cDNA libraries were transformed into yeast using standard procedures. 

To perform the protein interaction screen, an overnight culture of the bait yeast strain 
was grown in 20 ml SD selective medium with 2% glucose (pDBLeu, SD -Leu medium, 
pDBtrp, SD -trp medium). The cultures were shaken vigorously at 30 °C overnight. The 
1 0 cultures were diluted 1:10 with complete medium (YEPD with 2% glucose) and the cultures 
then incubated with shaking for 2 hrs at 30 °C. 

The frozen prey library was thawed, and the yeast cells reactivated by growing them 
in 150 ml YEPD medium with 2% glucose for 2 hrs at 30°C. A filter unit was sterilized with 
70% ethanol and washed with sterile water to remove the ethanol. The cell densities of both 
15 bait and prey cultures were measured by determining the OD at 600 nm. An appropriate 

volume of yeast cells that corresponded to a cell number of 1 ml of OD 600 = 4 of each yeast 
strain, bait and prey (library) was placed in a 50 ml Falcon tube. The mixture was then 
filtered through the sterilized filter unit. The filter was then transferred onto a prewanned 
YEPD agar plate with the cell side up, removing all air bubbles underneath the filter. Plates 
20 were then incubated at 30°C for 6 hrs. One filter was transferred into a 50 ml Falcon tube, 
and 10 ml of SD with 2% Glucose was added; cells were resuspended by vortexing for 10 
sec. 

The number of primary diploid cells (growth on SD -Leu, -Tip plates) versus the 
numbers of colony forming units growing on SD -Trp and SD -Leu plates only was then 
25 titered. Different dilutions were plated and incubated at 30°C for two days. The number of 
colony forming units was then counted. The number of diploid colonies (colonies on SD - 
Leu -Trp plates) permits the calculation of whether or not the whole library of prey constructs 
was mated to the yeast expressing the bait. This information is important to judge the quality 
of the screen. 
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16.1 Indirect selection 

Resuspended cells from 5 filter-matings were then pooled and the cells sedimented by 
centrifiigation in a 50 ml Falcon tube. Cells were then resuspended in 16 ml SD medium 
with 2% Glc. Two ml of this cell suspension was plated onto 8 square plates each (SD -Leu, - 
5 Tip) with sterile glass beads and selected for diploid cells by incubating at 30° C for 18 - 20 
hrs. 

Cells were then scraped off the square plates, the cells centrifiiged and combined into 
one 50 ml Falcon tube. The cell pellet was then resuspended in 25 ml of SD medium with 
2% glucose. The cell number was then determined by counting of an appropriate dilution 

1 0 (usually 1 : 1 00 to 1 : 1 000) with a Neugebauer chamber. Approximately 5 x 1 0 7 diploid cells 
were plated onto the selective medium. The observations about the growth of the bait strain 
together with irrelevant prey vectors helps to determine which selective plates will have to be 
chosen for the library screen. Generally, all screens were plated on one square plate each 
with SD -Leu, -Trp, -His; SD -Leu, -Trp, His, 5 mM 3AT, and SD -Leu, -Trp, -His, -Ade is 

15 recommended. 

The yeast cells were then spread homogeneously with sterile glass beads and 
incubated at 30°C for 4 days. The number of colony forming yeast cells was titered by 
plating different dilutions of the scraped cell suspension onto SD -Leu, -Trp plates. Usually, 
plating of 100 jxl of a 10" 3 and 10 -4 dilution gave 100 - 1000 colonies per plate. 

20 

16.2 Direct selection 

Five filters with the mated yeast cells were each transferred into separate 50 ml 
Falcon tubes and the cells resuspended with 10 ml SD medium with 2% Glc, each, followed 
by vortexing for 10 sec. The resuspended cells were combined and centrifiiged in a Beckman 
25 centrifuge at 3000 rpm. The supernatant was discarded and the cells resuspended in 6 ml of 
SD medium with 2% Glc. Two ml of the suspension was spread onto each selective square 
plate and incubated at 30° C for 4 - 5 days. 
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16.3 Isolation of Single Colonic 

Yeast cells from an isolated colony were picked with a sterile tooth pick and 
transferred into individual wells of a 96 well plate. The cells were resuspended in 50 [il of SD 
-Leu, -Trp, -His medium and incubated at 30°C for one day. The yeast cells were then 
stamped onto a SD -Leu, -Tip, -His plate in 96 well format and incubated at 30 °C for 2 days. 
Yeast cells were also stamped onto a Nylon filter covering a YEPD plate and incubated at 
30°C for one day. The cells on the Nylon filter were used for the analysis of the R - Gal 
reporter activity. 

Yeast colonies were scraped from the SD -Leu, -Trp, -His plate with a sterile tooth 
pick, and reconfigured, if necessary, according to the fi-Gal activity and then resuspended in 
20 % glycerol. This served as a master plate for storage at -80 °C. 

For DNA preparation, yeast cells from the glycerol stock were stamped onto a SD - 
Trp plate and incubated at 30°C for 2 days. After two days of incubation, the yeast colonies 
were ready for colony PCR and sequencing. Standard colony PCR conditions were used to 
amplify inserts from preys recovered from the interaction screen. Sequencing was done 
using standard sequencing reactions and ABI377 (Perkin Elmer) fluorescent sequencing 
machines. 

16.4 Verification of bait/pre y interaction 

Glycerol stocks of the prey of interest were thawed and inoculated in a 10 ml 
overnight culture of SD with glucose -Trp. After overnight growth, plasmid DNA 
preparation was performed using the BIO 101 RPM Yeast Plasmid Isolation Kit with 10 ml 
of culture. The culture was centrifuged and transferred to a 1.5 ml microcentrifuge tube. 
Yeast Lysis Matrix was then added to the pellet followed by 250 jil of Alkaline Lysis 
Solution. Samples were then vortexed for 5 minutes. 250 jxl Neutralizing Solution was 
added and the sample mixed briefly. Samples were centrifuged for 2 minutes at room 
temperature in a microcentrifuge. The supernatant was transferred to a Spin Filter avoiding 

debris and Lysis Matrix. 250 jil of Glassmilk Spin Buffer was added, and the tubes inverted 

< 

to mix. Samples were centrifuged for 1 min and the liquid in the Catch Tube was discarded. 
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500 ^1 of Wash Solution was added, the samples were centrifuged for 1 min, and the wash 
solution was discarded. The wash step was repeated once followed by a 1 min dry 
centrifugation to drive the remaining liquid out of the Spin Filter. The filter was transferred 
to a new Catch Tube and 100 \x\ of sterile H 2 0 was added; samples were then vortexed briefly 
5 to resuspend and centrifuged for 30 seconds to collect the DNA in the bottom of the Catch 
Tube. 

Five nl of DNA was then transformed into DH10B Electrornax cells using standard 
procedures and glycerol stocks prepared. Miniprep DNA was prepared using the Qiagen 
QIAprep Spin Miniprep Kit. DNA was finally eluted with 30 jal of Qiagen EB buffer. One 
10 (ill of the plasmid DNA samples was then used to transform yeast cells using standard 

procedures. After 2 days of growth on SD -trp media, colonies were picked and patched onto 
fresh media. Similarly, bait colonies were patched onto SD -Leu media. Both were grown 
overnight at 30°C. 

For mating, cells from bait and prey patches were spread together on YAPD media 
15 and incubated at 30°C for 12 hr. This plate was then replica plated onto an SD Agar-Leu-Trp 
plate and grown for 2 days at 30 °C. To test the strength of interaction these plates were 
replica plated onto SD Agar-Leu-Trp-His, SD Agar-Leu-Trp-His with 5 mM 3 AT and 10 
mM 3 AT, SD Agar-Leu-Trp-His- Ade, and SD Agar-Leu-Trp-Ura media and grown for 2 
daysat30°C. 

20 

16.5 Galacton Star fi-Galactosidase Activity Assay 

After streaking and replica plating positive interactors on selection plates, colonies 
were placed in a 96 well dish with 200 \il of SD-medium, leaving wells 1 and 96 blank. Ten 
microliters from the first 96 well dish was plated into another flat bottom 96 well dish 
25 containing 100 \il of SD-medium. Controls consisted of a negative control and a very weak 
positive control. The cell density was measured at OD 600 (a value of 1 corresponds to lxl 0 7 
cells utilizing a 96 well spectrophotometer). The OD was usually between 0.03 and 0.10. 
Using microplates specifically for the luminometer, 50 ill of reaction mixture were pipetted 
into each well. Fifty microliters of culture were then added and mixed by pipetting up and 
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down twice. The reaction was incubated for 30 minutes at room temperature followed by 
measurement of Relative Light Units using a luminometer. 

Table 9 lists the genes identified in the yeast two hybrid screens from the 3 prey 
libraries tested. Two genes, 2yxin and axin, were found to interact with the cytoplasmic 
5 domain of (LRP5) 1 in all three screens. Three genes, alpha-actinin, TCB and SI -5 interacted 
in two of the three screens. 

A variety of proteins found at sites of cell-cell and cell-matrix contact (focal 
contacts/adesion plaques) were shown to interact with the cytoplasmic domain of Zmaxl 
(LRP5). These include a-actinin, Trio, Pinch-like protein, and Zyxin. PINCH is a LIM 
10 domain-containing protein that is known to interact with integrin-linked kinase, an early 
signaler in integrin and growth factor signaling pathways. The finding of a closely related 
gene in the yeast two hybrid screen raises the possibility of a novel pathway linked to integrin 
signaling from extracellular matrix signals. Trio, also known to localize to focal adhesions, is 
thought to play a key role in coordinating cell-matrix interactions and cytoskeletal 
15 rearrangements involved in cell movement. Zyxin, another LIM domam-containing protein, 
is also localized to adhesion plaques and is thought to be involved in reorganization of the 
cytoskeleton when triggers are transmitted via integrin signaling pathways. Zyxin also 
interacts with alpha actinin, which we identified as interacting with Zmaxl (LRP5). Other 
LEVI domain containing proteins identified include the human homologue of mouse ajuba, 
20 LIMD1, and a novel LIMDl-like protein. 

Axin was also identified from the two hybrid experiments. This protein is involved in 
inhibition of the Wnt signaling pathway and interacts with the tumor suppressor APC. There 
is a link here with the focal adhesion signaling described above: one common step in the two 
pathways involves inhibition of glycogen synthase kinase 3, which in turn results in the 
25 activation of B-catenin/Lef-1 and AP-1 transcription factors. Axin/APC are involved in this 
as well as integrin linked kinase. The Wnt pathway has a role in deterniining cell fates during 
embryogenesis. If inappropriately activated, the Wnt pathway may also lead to cancer. The 
Wnt pathway also seems to have a role in cytoskeletal rearrangements. In eiXettopus embryo 
assay, the combination of HBM and Wnt5a proteins stimulated the Wnt pathway to a much 
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greater extent than the combination of Zmaxl (LRP5) and Wnt5a, which was modestly above 
the control and Wnt5a alone scores. The HBM and Zmaxl (LRP5) extracellular domains 
(ECD) caused a modest stimulation of Wht signaling in the absence of Wnt5a which was 
slightly increased by the presence of Wnt5a in the presence of HBM ECD. A model 
5 depicting Zmaxl (LRP5) involvement in focal adhesion signaling is depicted in Fig. 15. 

This data coupled with other studies suggest that integrin signaling pathways have a 
role in cellular responses to mechanical stress and adhesion. This provides an attractive 
model for the mechamsm of action of Zmaxl (LRP5) in bone biology. It is possible that 
Zmaxl (LRP5) is involved in sensing either mechanical stress directly or binding a molecule 
10 in the extracellular matrix that is related to mechanical sensation. Signaling through 

subsequent pathways may be involved in bone remodeling due to effects on cell morphology, 
cell adhesion, migration, proliferation, differentiation, and apoptosis in bone cells. 



TABLE 10: Yeast Two Hybrid Results 



Gene 
Symbol 


Gene 


Genbank 
Accession # 


NT 
SEQID 
NO: 


AA 
SEQID 
NO: 


ACTN1 


alpha-actinin 


NM 001102 


642 




AES 


ammo-terminal enhancer of 


NM 001130.3 


643 




AIP4 


atrophia- 1 interacting protein 


AF038564.1 


644 




Novel 


Ajuba 




645 




AXIN 


Wnt signaling 


AF009674.1 


646 




CDC23 


cell division cycle 23, yeast, 
homolog 


NM_004661.1 


647 




HSM800944 


Similar to TRIO 


AL1 17435.1 


748 




HSM800936 




AL1 17427.1 


649 




Novel 


Similar to LIM domains 
containing protein 1 




650 




DEEPEST 


mitotic spindle coiled-coil 
related protein 


NM_006461.1 


651 




ECM1 


extracellular matrix protein 1 


U65932.1 


652 




EF1A 


elongation factor 1 -alpha 


X16869.1 


653 




FN 


fibronectin 


X02761.1 


654 
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Svmbol 


bene 


Genbank 

ACt^MULl rr 


NT 
NO: 


AA 
NO: 




homeodomain protein 


T TO 1 c r\c\ -i 

US 1599.1 


655 




i\iovei 


Glu-Lys Rich protein 




656 




t TA/mi 

J^JUVLLJ 1 


LIM domains containing 1 


NM 014240.1 


657 




Novel 


PINCH-like 




658 






centrosomal protein 


NM 005493.1 


659 




olO 


extracellular protein 


U03 877.1 


660 






gene encoding cytosolic 

LULjfUJlU. llOllXLUIlC-UlXLUJJlg 


M26252.1 


661 




TTT* 
I JLL* 


tumorous imaginal discs 


NM _005 147.1 


662 




I A 


Zyxin 


NM 003461.1 


663 




TRIO 


GTPase 


U42390.1 


664 




ttt TA/TDTT'DT^ 

JrlUiVLri. I re 


phosphatidylinositol transfer 


D30037.1 


665 




ACTNl 


alpha-actinin 


NP 001093.1 




666 




ammo-terminal enhancer of 


NP 001121.2 




667 


AJP4 


atrophin-1 interacting protein 


AAC04845.1 




668 


Novel 


Ajuba 






669 


A "VTKT 


Wnt signaling 


AAC5 1624.1 




670 





cell division cycle 23, yeast 

tirim r\1 or 


NP_004652. 1 




671 


Novel 


O J 1 1 a. _ '"I'll T/~\ /""I A T"> ^ ^ 4 

Similar to TRIO CAB55923.1 






672 


Novel 


Similar to LIM domains 
i>u±i Ldimiig proicin 1 






673 




mitotic spindle coiled-coil 
icicticu protein 


NP_006452.1 




674 




cALid-uciiuicu matrix proiein 1 






675 


EF1A 


elongation factor 1 -alpha 


CAA34756.1 




676 


FN 


fibronectin 


CAA26536.1 




677 


Novel 


Glu-Lys rich protein 






678 


HOXB13 


homeodomain protein B13 


AAB39863.1 




679 


LIMD1 


LIM domains containing 1 


NP 055055.1 




680 


Novel 


PINCH-like 






681 


RANBPM 


centrosomal protein 


NP 005484.1 




682 


Sl-5 


extracellular protein 


AAA65590.1 




683 
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Symbol 


Gene 


Genbank 
Accession # 


iNT 
SEQ ID 
NO: 


A A 

AA 
SEQ ID 
NO: 


TCB 


cytosolic thyroid hormone- 
binding protein 


AAA36672.1 




684 


TID 


tumorous imaginal discs 


NP 005138.1 




685 


ZYX 


Zyxin 


NP 003452.1 




686 


TRIO 


GTPase 


AAC34245.1 




687 


PTDINSTP 


phosphatidylinositol transfer 
protein beta isoform 


P48739 




688 



In light of the model depicted in Fig. 15 and the results shown in Table 10, another 
aspect contemplated by the invention would be to regulate bone density and bone mass 
disorders by the regulating focal adhesion signaling. The regulation can occur by regulating 

10 the DNA, mRNA transcript or protein encoded by any of the members involved in the focal 
adhesion signaling pathway as identified by the yeast two hybrid system. 

Also contemplated are the novel nucleic acids and proteins identified by the HBM 
yeast two hybrid system. These include but are not limited to SEQ ID NO: 645 (Ajuba), SEQ 
ID NO: 651 (a gene similar to a gene encoding LIM domains containing protein 1), SEQ ID 

15 NO: 656 (Glu-Lys Rich protein), SEQ ID NO: 658 (PINCH-like gene), SEQ ID NO: 669 

(Ajuba protein), SEQ ID NO: 672 (protein similar to TRIO), SEQ ID NO: 673, SEQ ID NO: 
678 (Glu-Lys rich protein) and SEQ ID NO: 681 (PINCH-like protein). 

17. Potential Function 

20 The protein encoded by Zmaxl (LRP5) and LRP6 are related to the Low Density 

Lipoprotein receptor (LDL receptor). See, Goldstein et al, 1985 Ann. Rev. Cell Biology, 1 : 
1-39; Brown et al, 1986 Science, 232:34-47. The LDL receptor is responsible for uptake of 
low density lipoprotein, a lipid-protein aggregate that includes cholesterol. Individuals with a 
defect in the LDL receptor are deficient in cholesterol removal and tend to develop 

25 artherosclerosis. In addition, cells with a defective LDL receptor show increased production 
of cholesterol, in part because of altered feedback regulation of cholesterol synthetic enzymes 

in 
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and in part because of increased transcription of the genes for these enzymes. In some cell 
types, cholesterol is a precursor for the formation of steroid hormones. 

Thus, the LDL receptor may, directly or indirectly, function as a signal transduction 
protein and may regulate gene expression. Because Zmaxl (LRP5) and LRP6 are related to 
the LDL receptor, this protein may also be involved in signaling between cells in a way that 
affects bone remodeling. 

The glycine 1 71 amino acid is likely to be important for the function of Zmaxl 
(LRP5) because this amino acid is also found in the mouse homologue of Zmaxl (LRP5). 
The closely related LRP6 (Genbank Accession No. JE0272) protein also contains glycine at 
the corresponding position (Brown et al 9 1988 Biochem. Biophys. Res. Comm. 248: 879-88). 
Amino acids that are important in a protein's structure or function tend to be conserved 
between species, because natural selection prevents mutations with altered amino acids at 
important positions from arising. 

In addition, the extracellular domain of Zmaxl (LRP5) contains four repeats 
consisting of five YWTD motifs followed by an EFG motif. This 5YWTD+EGF repeat is 
likely to form a distinct folded protein domain, as this repeat is also found in the LDL 
receptor and other LDL receptor-related proteins. The first three 5 YWTD+EGF repeats are 
very similar in their structure, while the fourth is highly divergent. Glycine 171 occurs in the 
central YWTD motif of the first 5YWTD+EGF repeat in Zmaxl (LRP5). The other two 
similar 5YWTD+EGF repeats of Zmaxl (LRP5) also contain glycine at the corresponding 
position, as does the 5 YWTD+EGF repeat in the LDL receptor protein. However, only 
17.6% of the amino acids are identical among the first three 5YWTD+EGF repeats in Zmaxl 
(LRP5) and the single repeat in the LDL receptor. These observations indicate that glycine 
171 is essential to the function of this repeat, and mutation of glycine 171 causes a functional 
alteration of Zmaxl (LRP5). The cDNA and peptide sequences are shown in Figs. 6A-6J. 
The critical base at nucleotide position 582 is indicated in bold and is underlined. 

Northern blot analysis (Figs. 7A-B) reveals that Zmaxl (LRP5) is expressed in human 
bone tissue as well as numerous other tissues. A multiple-tissue Northern blot (Clontech, 
Palo Alto, CA) was probed with exons from Zmaxl (LRP5). As shown in Fig. 7 A, the 5.5 kb 

112 



) 02092000A2J_> 



WO 02/092000 PCT/US02/14877 



Zmaxl (LRP5) transcript was highly expressed in heart, kidney, lung, liver and pancreas and 
is expressed at lower levels in skeletal muscle and brain. A second northern blot, shown in 
Fig. 7B, confirmed the transcript size at 5.5 kb, and indicated that Zmaxl (LRP5) is 
expressed in bone, bone marrow, calvaria and human osteoblastic cell lines. 
5 Taken together, these results coupled with the yeast two hybrid results indicate that 

the HBM polymorphism in the Zmaxl (LRP5) gene is responsible for the HUM phenotype, 
and that the Zmaxl gene is important in bone development. In addition, because mutation of 
Zmaxl can alter bone mineralization and development, it is likely that molecules that bind to 
Zmaxl may usefully alter bone development. Such molecules may include, for example, 
10 small molecules, proteins, RNA aptamers, peptide aptamers, and the like. 

18. Preparation of Nucleic Acids. Vectors, Transformations and Host Cells 

Large amounts of the nucleic acids of the present invention may be produced by 
replication in a suitable host cell. Natural or synthetic nucleic acid fragments coding for a 

1 5 desired fragment will be incorporated into recombinant nucleic acid constructs, usually DNA * 
constructs, capable of introduction into and replication in a prokaryotic or eukaryotic cell. 
Usually the nucleic acid constructs will be suitable for replication in a unicellular host, such 
as yeast or bacteria, but may also be intended for introduction to (with and without 
integration within the genome) cultured mammalian or plant or other eukaryotic cell lines. 

20 The purification of nucleic acids produced by the methods of the present invention is 

described, for example, in Sambrook et al. 9 Molecular Cloning. A Laboratory Manual,, 2nd 
Ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et aL, 
Current Protocols in Molecular Biology, J. Wiley and Sons, NY (1992). 

The nucleic acids of the present invention may also be produced by chemical 

25 synthesis, e.g., by the phosphoramidite method described by Beaucage et aL, 198 1 Tetra. 

Lett. 22: 1859-62 or the triester method according to Matteucci,e* a/., 1981 Am. Chem. Soc. 
103: 3185, and may be performed on commercial, automated oligonucleotide synthesizers. A 
double-stranded fragment may be obtained from the single-stranded product of chemical 
synthesis either by synthesizing the complementary strand and annealing the strands together 
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under appropriate conditions or by adding the complementary strand using DNA polymerase 
with an appropriate primer sequence. 

Nucleic acid constructs prepared for introduction into a prokaryotic or eukaryotic host 
may comprise a replication system recognized by the host, including the intended nucleic 
acid fragment encoding the desired protein, and will preferably also include transcription and 
translational initiation regulatory sequences operably linked to the protein encoding segment. 
Expression vectors may include, for example, an origin of replication or autonomously 
replicating sequence (ARS) and expression control sequences, a promoter, an enhancer and 
necessary processing information sites, such as ribosome-binding sites, RNA splice sites, 
polyadenylation sites, transcriptional terminator sequences, and mRNA stabilizing sequences. 
Secretion signals may also be included where appropriate, whether from a native HBM or 
Zmaxl (LRP5) protein or from other receptors or from secreted proteins of the same or 
related species, which allow the protein to cross and/or lodge in cell membranes, and thus 
attain its functional topology, or be secreted from the cell. Such vectors may be prepared by 
means of standard recombinant techniques well known in the art and discussed, for example, 
in Sambrook et al, (1989) or Ausubel et al, (1992). 

An appropriate promoter and other necessary vector sequences will be selected so as 
to be functional in the host, and may include, when appropriate, those naturally associated 
with Zmaxl (LRP5) or HBM genes. Examples of workable combinations of cell lines and 
expression vectors are described in Sambrook et aL, (1989) or Ausubel et aL 9 (1992). Many 
useful vectors are known in the art and may be obtained from such vendors as Stratagene, 
New England BioLabs, Promega Biotech, and others. Promoters such as the trp, lac and 
phage promoters, tRNA promoters and glycolytic enzyme promoters may be used in 
prokaryotic hosts. Useful yeast promoters include promoter regions for metallothionein, 3- 
phosphoglycerate kinase or other glycolytic enzymes such as enolase or glyceraldehyde-3- 
phosphate dehydrogenase, enzymes responsible for maltose and galactose utilization, and 
others. Vectors and promoters suitable for use in yeast expression are further described in EP 
73,675 A. Appropriate non-native mammalian promoters might include the early and late 
promoters from SV40 (Fiers et aL, 1978 Nature 273: 1 13) or promoters derived from murine 
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Moloney leukemia virus, mouse tumor virus, avian sarcoma viruses, adenovirus II, bovine 
papilloma virus or polyoma. In addition, the construct may be joined to an amplifiable gene 
(e.g., DHFR) so that multiple copies of the gene may be made. For appropriate enhancer and 
other expression control sequences, see also Enhancers and Eukaryotic Gene Expression, 
5 Cold Spring Harbor Press, Cold Spring Harbor, NY (1983). 

While such expression vectors may replicate autonomously, they may also replicate 
by being inserted into the genome of the host cell, by methods well known in the art. 

Expression and cloning vectors will likely contain a selectable marker, a gene 
encoding a protein necessary for survival or growth of a host cell transformed with the vector. 

10 The presence of this gene ensures growth of only those host cells which express the inserts. 

Typical selection genes encode proteins that a) confer resistance to antibiotics or other toxic 
substances, e.g. ampicillin, neomycin, methotrexate, etc.; b) complement auxotrophic 
deficiencies, or c) supply critical nutrients not available from complex media, e.g., the gene 
encoding D-alanine racemase for Bacilli. The choice of the proper selectable marker will 

15 depend on the host cell, and appropriate markers for different hosts are well known in the art. ^ 
The vectors containing the nucleic acids of interest can be transcribed in vitro, and the 
resulting RNA introduced into the host cell by well-known methods, e.g., by injection (see, 
Kubo et al. 9 FEES Letts. 241 : 119 (1988)), or the vectors can be introduced directly into host 
cells by methods well known in the art, which vary depending on the type of cellular host, 

20 including electroporation; transfection employing calcium chloride, rubidium chloride, 
calcium phosphate, DEAE-dextran, or other substances; microprojectile bombardment; 
lipofection; infection (where the vector is an infectious agent, such as a retroviral genome); 
and other methods. See generally, Sambrook et ah, 1989 and Ausubel et ah, 1992. The 
introduction of the nucleic acids into the host cell by any method known in the art, including 

25 those described above, will be referred to herein as "transformation." The cells into which 

have been introduced nucleic acids described above are meant to also include the progeny of 
such cells. 

Large quantities of the nucleic acids and proteins of the present invention may be 
prepared by expressing the Zmaxl(LRP5), LRP6, HBM or HBM-like nucleic acids or 
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portions thereof in vectors or other expression vehicles in compatible prokaryotic or 
eukaryotic host cells. The most commonly used prokaryotic hosts are strains of Escherichia 
coli, although other prokaryotes, such as Bacillus subtilis or Pseudomonas may also be used. 

Mammalian or other eukaryotic host cells, such as those of yeast, filamentous fungi, 
plant, insect, or amphibian or avian species, may also be useful for production of the proteins 
of the present invention. Propagation of mammalian cells in culture is per se well known. 
See, Jakoby and Pastan (eds.), Cell Culture. Methods in Enzyniology, volume 58, Academic 
Press, Inc., Harcourt Brace Jovanovich, NY, (1979). Examples of commonly used 
mammalian host cell lines are VERO and HeLa cells, Chinese hamster ovary (CHO) cells, 
and WI38, BHK, and COS cell lines, although it will be appreciated by the skilled 
practitioner that other cell lines may be appropriate, e.g., to provide higher expression 
desirable glycosylation patterns, or other features. 

Clones are selected by using markers depending on the mode of the vector 
construction. The marker may be on the same or a different DNA molecule, preferably the 
same DNA molecule. Li prokaryotic hosts, the transformant may be selected, e.g., by 
resistance to ampicillin, tetracycline or other antibiotics. Production of a particular product 
based on temperature sensitivity may also serve as an appropriate marker. 

Prokaryotic or eukaryotic cells transformed with the nucleic acids of the present 
invention will be useful not only for the production of the nucleic acids and proteins of the 
present invention, but also, for example, in studying the characteristics of Zmaxl (LRP5), 
LRP6, HBM or HBM-like proteins. 

Antisense nucleic acid sequences are useful in preventing or diminishing the 
expression of Zmaxl (LRP5), LRP6, HBM or HBM-like molecules, as will be appreciated by 
one skilled in the art. For example, nucleic acid vectors containing all or a portion of the 
Zmaxl (LRP5), LRP6, HBM or HBM-like nucleic acid may be placed under the control of a 
promoter in an antisense orientation and introduced into a cell. Expression of such an 
antisense construct within a cell will interfere with Zmaxl (LRP5), LRP6, HBM or HBM-like 
transcription and/or translation and/or replication. 

The probes and primers based on the Zmaxl (LRP5), LRP6, HBM or HBM-]ike gene 
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sequences disclosed herein are used to identify homologous gene sequences and proteins in 
other species. The gene sequences and proteins can also be used in the diagnostic/prognostic, 
therapeutic and drug screening methods described herein for the species from which they 
have been isolated. 

5 

19. Protein Expression and Purification 

Expression and purification of the Zmaxl (LRP5), LRP6, HBM or HBM-like proteins 
of the invention can be performed essentially as outlined below (LRP5, LRP6 and HBM-like 
proteins are also included when referring only to HBM). To facilitate the cloning, expression 

10 and purification of membrane and secreted protein from the HBM gene, a gene expression 
system, such as the pET System (Novagen), for cloning and expression of recombinant 
proteins in E. coli was selected. Also, a DNA sequence encoding a peptide tag, the His-Tap, 
was fused to the 3 f end of DNA sequences of interest to facilitate purification of the 
recombinant protein products. The 3' end was selected for fusion to avoid alteration of any 5 r 

1 5 terminal signal sequence. 

Nucleic acids chosen, for example, from the nucleic acids set forth in SEQ ID NOS: 
1, 3 and 5-12 for cloning HBM were prepared by polymerase chain reaction (PCR). 
Synthetic oligonucleotide primers specific for the 5* and 3* ends of the HBM nucleotide 
sequence were designed and purchased from Life Technologies (Gaithersburg, MD). All 

20 forward primers (specific for the 5' end of the sequence) were designed to include an Ncol 
cloning site at the 5' terminus. These primers were designed to permit initiation of protein 
translation at the methionine residue encoded within the Ncol site followed by a valine 
residue and the protein encoded by the HBM DNA sequence. All reverse primers (specific 
for the 3 ! end of the sequence) included an EcoRI site at the 5 ! terminus to permit cloning of 

25 the HBM sequence into the reading frame of the pET-28b. The pET-28b vector provided a 
sequence encoding an additional 20 carboxyl-terminal amino acids including six histidine 
residues (at the C-terminus), which comprised the histidine affinity tag. 

Genomic DNA prepared from the HBM gene was used as the source of template DNA 
for PCR amplification (Ausubel et al, Current Protocols in Molecular Biology, John Wiley 
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& Sons (1994)). To amplify a DNA sequence containing the HBM nucleotide sequence, 
genomic DNA (50 ng) was introduced into a reaction vial containing 2 mM MgCl 2 , 1 juM 
synthetic oligonucleotide primers (forward and reverse primers) complementary to and 
flanking a defined HBM, 0.2 mM of each of deoxynucleotide triphosphate, dATP, dGTP, 
dCTP, dTTP and 2.5 units of heat stable DNA polymerase (AmpliTaq™, Roche Molecular 
Systems, Inc., Branchburg, NJ) in a final volume of 100 microliters. 

Upon completion of thermal cycling reactions, each sample of amplified DNA was 
purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD). All 
amplified DNA samples were subjected to digestion with the restriction endonucleases, e.g., 
Ncol and EcoRI (New England BioLabs, Beverly, MA) (Ausubel et al , Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994)). DNA samples were then subjected to 
electrophoresis on 1.0% NuSeive (FMC BioProducts, Rockland, ME) agarose gels. DNA 
was visualized by exposure to ethidium bromide and long wave UV irradiation. DNA 
contained in slices isolated from the agarose gel was purified using the Bio 101 GeneClean 
Kit protocol (Bio 101, Vista, CA). 

The pET-28b vector was prepared for cloning by digestion with restriction 
endonucleases, e.g., Ncol and EcoRI (New England BioLabs, Beverly, MA) (Ausubel et al, 
Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994)). The pET-28a 
vector, which encodes the histidine affinity tag that can be fused to the 5' end of an inserted 
gene, was prepared by digestion with appropriate restriction endonucleases. 

Following digestion, DNA inserts were cloned (Ausubel et al, Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994)) into the previously digested pET-28b 
expression vector. Products of the ligation reaction were then used to transform the BL21 
strain of E. coli (Ausubel et al, 1994) as described below. 

Competent bacteria, E. coli strain BL21 or E, coli strain BL21 (DE3), were 
transformed with recombinant pET expression plasmids carrying the cloned HBM sequence 
according to standard methods (Ausubel et al, 1994). Briefly, 1 \a\ of ligation reaction was 
mixed with 50 jlxI of electrocompetent cells and subjected to a high voltage pulse, after which 
samples were incubated in 0.45 ml SOC medium (0.5% yeast extract, 2.0% tryptone, 10 mM 
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NaCl, 2.5 mM KC1, 10 mM MgCl 2 , 10 mM MgS0 4 and 20 mM glucose) at 37 °C with 
shaking for 1 hour. Samples were then spread on LB agar plates containing 25 ^ig/ml 
kanamycin sulfate for growth overnight. Transformed colonies of BL21 were then picked 
and analyzed to evaluate cloned inserts, as described below. 
5 Individual BL21 clones transformed with recombinant pET-28b HBM nucleotide 

sequences were analyzed by PCR amplification of the cloned inserts using the same forward 
and reverse primers specific for the HBM sequences that were used in the original PCR 
amplification cloning reactions. Successful amplification verifies the integration of the HBM 
sequence in the expression vector (Ausubel et al. 9 1994). 

10 Individual clones of recombinant pET-28b vectors carrying properly cloned HBM 

nucleotide sequences were picked and incubated in 5 ml of LB broth plus 25 jig/ml 
kanamycin sulfate overnight. The following day plasmid DNA was isolated and purified 
using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA). 

The pET vector can be propagated in any E. coli K-12 strain, e.g., HMS174, HB101, 

1 5 JM1 09, DH5 and the like, for purposes of cloning or plasmid preparation. Hosts for 

expression include E. coli strains containing a chromosomal copy of the gene for T7 RNA 
polymerase. These hosts were lysogens of bacteriophage DE3, a lambda derivative that 
carries the lad gene, the lacUV5 promoter and the gene for T7 RNA polymerase. T7 RNA 
polymerase was induced by addition of isopropyl-P-D-thiogalactoside (IPTG), and the T7 

20 RNA polymerase transcribes any target plasmid containing a functional T7 promoter, such as 
pET-28b, carrying its gene of interest. Strains include, for example, BL21(DE3) (Studier et 
aL, 1990 Meth. EnzymoL, 185: 60-89). 

To express the recombinant HBM sequence, 50 ng of plasmid DNA are isolated as 
described above to transform competent BL21(DE3) bacteria as described above (provided 

25 by Novagen as part of the pET expression kit). The lacZ gene (p-galactosidase) is expressed 
in the pET-System as described for the HBM recombinant constructions. Transformed cells 
were cultured in SOC medium for 1 hour, and the culture was then plated on LB plates 
containing 25 jig/ml kanamycin sulfate. The following day, the bacterial colonies were 
pooled and grown in LB medium containing kanamycin sulfate (25 |ig/ml) to an optical 
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density at 600 nM of 0.5 to 1 .0 O.D. units, at which point 1 mM IPTG was added to the 
culture for 3 hours to induce gene expression of the HBM recombinant DNA constructions. 

After induction of gene expression with IPTG, bacteria were collected by 
centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4°C. Pellets were 
5 resuspended in 50 ml of cold mM Tris-HCl, pH 8.0, 0. 1 M NaCl and 0. 1 mM EDTA (STE 
buffer). Cells were then centrifuged at 2000 x g for 20 minutes at 4°C. Wet pellets were 
weighed and frozen at -80 °C until ready for protein purification. 

19.1 Chinese Hamster Ovary (CHO) Expr ession System 
10 Alternatively, HBM and Zmaxl (LRP5) may be expressed in eukaryotic cells. 

Eukaryotic cells, such as mammalian derived cell lines, are more capable of expressing 
properly folded proteins containing cystine rich domains such as the EGF and LDLR 
modules. 

15 19.2 Develop ment of Crmstrnr.ts 

HBM and Zmaxl (LRP5) extracellular domain fusions (ECD) to IgG-Fc were 
prepared. These ECD fusions to the IgG-Fc domain remove the endogenous transmembrane 
and cytoplasmic portion of the Zmaxl/HBM receptor and should produce a secreted fusion 
protein. The Fc region is separated from the Zmaxl/HBM ECD by an enterokinase 

20 recognition site so that purified Zmaxl or HBM ECD protein can be obtained without the Fc 
domain present. The vector used for this construct was pHTop, a derivative of pED 
(Kaufinan et al 9 1991 Nua Acids Res. 19: 4485-90) in which the majority of the adenomajor 
late promoter was replaced by six repeats of the tet operator (Gossen et aL y 1992 Proc. Natl 
Acad. Set USA 89: 5547-51). This vector contains the dihydrofolate reductase (dhfr) gene, 

25 and when introduced in the cell line CHO/A2 (see description below), functions very 

efficiently. Clones with high expression can be selected by isolating cells which survive in 
high methotrexate (MTX) concentrations. 

The CHO expression vector pHTOP-Fc was digested with Sa/I and Noil. The 
intervening sequence was purified away from the rest of the vector by electroelution from an 
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acrylamide gel slice. Sail cuts 5' to the intrinsic honey bee mellitin signal sequence in 
pHTOP-Fc, and Notl cuts just 5 ! to the coding sequence IgGl-Fc. The resulting Sall-Notl 
pHTOP-Fc vector has the signal sequence removed and the Notl cloning site is amenable to 
creating a 5* fusion to IgG-Fc. Full-length Zmaxl (LRP5) in pCMVSPORT6 and full-length 
5 HBM in pCMVSPORT6 were digested individually withXmal which cuts within the region 
of the ORF that encodes the signal sequence) and BaniHL (that cuts internally in the ORF) to 
generate a 2286 bp 5' fragment of Zmaxl and HBM. The mutation which distinguishes 
Zmaxl from HBM lies on this fragment. Separately, the Zmaxl DNA was digested with 
BaniHL and Sacl to isolate an 1800 bp 3' fragment which is common to both the Zmaxl and 
10 the HBM genes. Together, these two fragments constitute the coding sequence for the HBM 
and Zmaxl extracellular domains, less the coding sequence for the first 6 amino acids of the 
signal sequence and ending 18 amino acids prior to the end of the extracellular domain, 
which we estimated from Kyte-Doolittle plots to end at the sequence "SPAHSS" (SEQ ID 
NO:698). 

15 A synthetic duplex was designed to recreate the coding sequence of the Zmaxl/HBM 

signal sequence 5' of the native Xma\ site, which included the initiator methionine and Kozak 
sequence. This duplex was designed to contain Sail (5 f ) and Xmal (3 1 ) cohesive ends to adapt 
ends to adapt the gene fragments described above to the pHTOP-Fc vector. This synthetic 
duplex was constructed from two partially complementary oligonucleotides as given below: 

20 S'-TCGACCACCATGGAGGCAGCGCCGC-S' (SEQ ID NO:699) 

3'-GGTGGTACCTCCGTCGCGGCGGGCC-5 f (SEQ ID NO:700) 
A second synthetic duplex was designed to recreate the 3 f coding sequence from a native Sacl 
site to the estimated end of the extracellular domain following the serine in the sequence ". . 
.SPAHSS", and to also encode a cloning site to allow in-frame fusion to the downstream IgG- 

25 Fc. This duplex was designed to contain Sacl (5 1 ) and Notl (3*) cohesive ends to adapt the 
gene fragments described above to the pHTOP-Fc vector. This synthetic duplex was 
constructed from two partially complementary oligonucleotides whose sequences are given 
below: 

S'-CATGTGTGAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACAG 
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CAGTGGC-3' (SEQ ID NO:701) 

3-TCGAGTACACACTTTAGTGGTTCGGCGGGAGTCTGCTGTC GGGCCGGGT 
GTCGTCACCGCCGG-5' (SEQ ID NO:702) 

The fragments, synthetic duplexes, and vector were ligated together in a single 
reaction. Separate reactions were performed for Zmax (LRP5) 1 and HBM. The ligation 
mixtures were used to transform electrocompetent & coli DH10B cells, and the resulting 
colonies were screened by radioactive colony hybridization using the common Sacl-BamYR 3' 
fragment as a probe. Colonies containing plasmids with the Zmaxl or HBM fragment 
inserted were identified, and plasmids were isolated from multiple candidates and their 
sequences were verified by DNA sequencing. Verified constructs were then used for 
transfection into CHO cells. 



19.2.1 Establishment of CHO stab le cell lines 

The CHO/A2 cell line is derived from CHO DUKX Bl 1 (Uriaub et aL 9 1980 Proa 
Natl Acad. ScL USA 77: 4216-20) by stably integrating a transcriptional activator (tTA), a 
fusion protein, between the Tet repressor and the herpes virus VP 16 transcriptional domain 
(Gossen et al) CHO cell lines expressing extracellular HBM-l.Fc and Zmaxl. Fc were 
established by transfecting (using lipofection) pHTopHBM-l.Fc into CHO/A2 cells and 
pHTopZmaxl .Fc into CHO/A2 cells. Clones were selected using by culturing the cells in 
0.02 fiM methotrexate. Clones were later amplified step-wise to a final concentration of 0.5 
liM methotrexate. 



19.2.2 Screening of CHO stable re.11 linRg 

Multiple clones were screened by a variety of techniques. Clones were screened by 
Western blot assay using a (mouse) anti-human IgG.Fc horseradish peroxidase (HRP) 
antibody. The same clones were also metabolically labeled with 35 S-Met/Cys) for a 6 hour 
pulse, or a 15 minute pulse, followed by a 1 hour, 4 hour, or 24 hour chase in media without 
radiolabeled Met/Cys. Immunoprecipitations were performed on proteins obtained from 
conditioned media, as well as from cell extracts. Purification is then performed followed by 
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sequencing of the proteins using N-terminal sequencing as known in the art. 

19.2.3 Fusion Protein Purification 

Zmaxl-IgG or HBM-IgG fusion protein can be purified from conditioned media or 
cell extracts of CHO stable cells. The fusion protein is isolated by affinity binding to protein 
5 A (for example using protein A coated beads or columns). The IgG-FC domain can then 
subsequently be cleaved from the Zmax/HBMl ECD protein by enterokinase digestion. 

19.2.4 Potential uses for cell lines and p rotein 

Stable cell lines may be used for generation of purified protein for use in ligand 
10 hunting, antibody generation, determination of crystal structure, and competitive binding 
assays. 

A variety of methodologies known in the art can be used to purify the isolated 
proteins (Coligan et al, Current Protocols in Protein Science, John Wiley & Sons (1995)). 
For example, the frozen cells can be thawed, resuspended in buffer and ruptured by several 
15 passages through a small volume microfluidizer (Model M-l 10S, Microfluidics International 
Corp., Newton, MA). The resultant homogenate is centrifuged to yield a clear supernatant 
(crude extract) and, following filtration, the crude extract is fractioned over columns. 
Fractions are monitored by absorbance at OD 2g0 nm and peak fractions may be analyzed by 
SDS-PAGE. 

20 The concentrations of purified protein preparations are quantified 

spectrophotometrically using absorbance coefficients calculated from amino acid content 
(Perkins, 1986 Eur. J. Biochem. 157:169-180). Protein concentrations are also measured by 
the method of Bradford, 1976 Anal Biochem. 72: 248-54 and Lowry et al 9 1951 «/ Biol. 
Chem. 193: 265-275 using bovine serum albumin as a standard. 

25 SDS-polyacrylamide gels of various concentrations were purchased from BioRad 

(Hercules, CA), and stained with Coomassie blue. Molecular weight markers may include 
rabbit skeletal muscle myosin (200 kDa), E. coli P-galactosidase (116 kDa), rabbit muscle 
phosphorylase B (97.4 kDa), bovine serum albumin (66.2 kDa), ovalbumin (45 kDa), bovine 
carbonic anhydrase (31 kDa), soybean trypsin inhibitor (21.5 kDa), egg white lysozyme (14.4 
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kDa) and bovine aprotinin (6.5 kDa). 

Once a sufficient quantity of the desired protein has been obtained, it may be used for 
various purposes. A typical use is the production of antibodies specific for binding. These 
antibodies may be either polyclonal or monoclonal, and may be produced by in vitro or in 
5 vivo techniques well known in the art. Monoclonal antibodies to epitopes of any of the 
peptides identified and isolated as described can be prepared from murine hybridomas 
(Kohler, 1975 Nature, 256: 495). In summary, a mouse is inoculated with a few micrograms 
ofHBM protein over a period of two weeks. The mouse is then sacrificed. The cells that 
produce antibodies are then removed from the mouse's spleen. The spleen cells are then 

10 fused with polyethylene glycol with mouse myeloma cells. The successfully fused cells are 
diluted in a microliter plate and growth of the culture is continued. The amount of antibody 
per well is measured by immunoassay methods such as ELISA (Engvall, 1980 Meth. 
Enzymol. 70: 419). Clones producing antibody can be expanded and further propagated to 
produce HBM antibodies. Other suitable techniques involve in vitro exposure of 

15 lymphocytes to the antigenic polypeptides, or alternatively, to selection of libraries of 

antibodies in phage or similar vectors. See Huse a/., 1989 Science 246: 1275-81. For 
additional information on antibody production see Davis et aL, Basic Methods in Molecular 
Biology, Elsevier, NY, Section 21-2 (1989). 

Additional uses for purified or isolated protein includes use in X-ray crystallography, 

20 binding assays, and so forth as described in greater detail infra. 

19.3 Zmaxl, LRP6, and Variant Antibodies 

Polyclonal antibodies were developed to both human Zmaxl (LRP5) (SEQ ID NO:3) 
and LKP6 (GenBank Accession No. AF074264). Antibodies can similarly be prepared 
25 against HBM and HBM like proteins and polypeptides. Peptides from the Zmaxl amino acid 
sequence were selected as immunogens based on five goals. 1) Maximize differences 
between Zmaxl and LRP6 amino acid sequences (71% amino acid identity). See Fig. 27. 
For sequence comparison, typically one sequence acts as a reference sequence, to which test 
sequences are compared. When using a sequence comparison algorithm, test and reference 

124 

BNSDOCID: <WO 02092000A2_I_> 



WO 02/092000 PCT/US02/14877 



sequences are input into a computer, subsequence coordinates are designated, if necessary, 
and sequence algorithm program parameters are designated. The sequence comparison 
algorithm then calculates the percent sequence identity for the test sequence(s), relative to the 
reference sequence, based on the designated program parameters. 2) Minimize potential 
5 cross reactivity with other known genes by performing sequence alignment and similarity 
searches. Optimal alignment of sequences for comparison can be conducted, e.g., by the 
local homology algorithm of Smith et aL 9 1981 Adv. Appl. Math. 2: 482, by the homology 
alignment algorithm of Needleman et al. 9 1970 J. Mol. Biol. 48: 443, by the search for 
similarity method of Pearson et aL, 1988 Proc. Natl. Acad. Set USA 85: 2444, by 

10 computerized implementations of these algorithms and others in programs contained in the 
Wisconsin genetics software package, Genetics Computer Group, 585 Science Dr., Madison, 
WI, or by visual inspection (see generally Ausubel et aL, Current Protocols in Molecular 
Biology, John Wiley & Sons (1997). Another example of algorithm that is suitable for 
determining percent sequence identity and sequence similarity is the BLAST algorithm, 

15 which is described in AltschvletaL, 1990 J. Mol. Biol. 215: 403-10. Softwarefor 
performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information. 3) Obtain peptides with the highest antigenicity index as possible 
as determined by PeptideStructure protein analysis using software programs contained in the 
Wisconsin Genetics software package, Genetics Computer Group, 575 Science Dr., Madison, 

20 WI. 4) Locating peptides relative to highly homologous domains {e.g., EGF-like domains 

and LDL receptor repeats) within the gene family and the location relative to the extracellular 
and cytoplasmic regions of the gene. 5) And, for human Zmaxl specific antibodies, the 
human amino acid sequence (SEQ ID NO: 3) was compared to the mouse Zmaxl sequence 
(GenBank Accession No. AF064984) and peptides were selected based on the above criteria 

25 in addition to minimizing the sequence similarity between the two species (See Figure 26). 

Using the same criteria above, LRP6 specific peptides were selected for polyclonal 
antibody production. Table 1 1 lists the amino acid sequences that were chosen, the amino 
acid differences within the peptide between the human and mouse sequences. All the peptide 
sequences (ranging from 12-18 amino acids) were provided to Sigma/Genosys (St. Louis, 
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MO) for peptide synthesis and subsequent polyclonal antibody production in New Zealand 
White Rabbits. The IgG fraction from the serum of each immunized rabbit was isolated 
using Protein G Sepharose (Amersham). Polyclonal antibody generation using these peptides 
may be done in other species as well, for example, chickens. This is often advantageous 
5 when there is a high degree of similarity between the human (reference) and murine/rodent 
sequence. 



TABLE 11 



Amino 
Acids 


Amino Acids 


SEQ 


H/M* 
Differences 


Comments 


171-187 


VETPRTERAGMDGSTRK 


703 


5 


Contains HBM 
polymorphism 


264-278 


NKRTGGKRKEILSAL 


704 


! 3 


Extracellular 


290-301 


ERQPFFHTRCEE 


705 


2 


Adjacent to EGF-I, 
extracellular 


532-546 


VDGTKRRTLLEDKLP 


706 


5 


Extracellular 


901-915 


DGLNDCMHNNGQCGQ 


707 


2 


In EGF-m, extracellular 


1010-1021 


PFVLTSLSQGQN 


708 


6 


Extracellular, human 
specific 


1415-1429 


YAGANGPFPHEYVSG 


709 


3 


Cytoplasmic 


1452-1464 


ACGKSMMSSVSLM 


710 


5 


Cytoplasmic, human 
specific 


1556-1573 


RWKASKYYLDLNSDSDPY 


711 


1 


Cytoplasmic 


888-902 


SGWNECASSNGHCSH 


712 




LRP6 specific 


1308-1321 


NGDANCQDKSDEKN 


713 




LRP6 specific 



* H/M-differences between human and mouse sequences 



Antibodies towards variants of LRP6, HBM and LRP5 are also contemplated. Based 
25 on the analysis of the structural model of the LRP5 beta propeller 1 (discussed in more detail 
in the Examples), interior regions of the propeller were analysed. Since the site-directed 
mutagenesis experiments had confirmed that modulation of propeller 1, particularly in the 
interior regions of beta propeller 1, could result in an HBM effect, a strategy was employed to 
generate antibodies with epitopes specific to these regions. Such antibodies to the wild type 
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LRP5 receptor could serve, for examples, as an HBM mimetic, by altering ligand/receptor 
interactions, protein-protein interactions, or by modulating Wnt or Dkk activity. Such 
antibodies could be used as therapies to treat, for example osteoporosis. 
One set of preferred antibodies includes: 



LRP5 Amino Acids 


Sequence 


208-223 


KLYWADAKLSFIHRAN 


277-291 


ALYSPMDIQVLSQER 


61-73 


GLEDAAAVDFQFSKGA 


234-247 


EGSLTHPFALTLSG 


249-264 


TLYWTDWQTRSIHACN 


144-156 


VLFWQDLDQPRAI 


194-210 


IYWPNGLTIDLEEQKLY 


34-47 


LLLFANRRDVRLVD 


75-89 


GAVYWTDVSEEAIKQ 


121-135 


KLYWTDSETNRIEVA 



Similar peptides can be used to prepare antibodies based on the LRP6 or HBM structures 
and the proposed mutations in their propellers. Other antibodies could easily be prepared 
based on the structural model discussed in the Examples, as would be readily appreciated 
by the skilled artisan. 

20 Another series of antibodies could be prepared which target the domains on HBM, 

LRP5, LRP6 and their variants which interact Dkk. Such antibodies could again serve as 
HBM mimetics, for example by displacing Dkk binding and thereby could be used as an 
osteoporosis therapeutic. Preferred peptides for preparing antibodies include but are not 
limited to: 

25 LRP5 Amino.Acids Sequence 

969-993 LILPLHGLRKVKAIDYDPLDKFIYW 
989-1013 KFIYWVDGRQNIKEAKDDGTQPFVL 
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LRPS Amino Acids Sequence 



10 



15 



20 



25 



1009-1033 QPFVLTSLSQGQNPDRQPHDLSEDI 

1029-1053 LSIDIYSRTLFWTCEATNTINVHRL 

10491073 NVHRLSGEAMGWLRGDRDKPRAIV 

1253-1266 CGEPPTCSPDQFAC 

1278-1295 WRCDGFPECDDQSDEEGC 

1316-1332 RCDGEADCQDRSDEADC 

1370-1383 CEITKPPSDDSPAH 

Additional peptides would be readily apparent to the artisan of ordinary skill. 

19.3.1 Sin gl e chain Fv molecules develop ed hy phage display 
Peptides were chosen from the Zmaxl (LRP5) sequence (SEQ ID NO: 3) to screen 
for single chain Fv (scFv) molecules by phage display. Similar peptides can be chosen for 
LRP6, HBM and HBM-like proteins to screen for scFv molecules. As discussed below, all 
mention of LRP5 is also meant to include these other proteins. 

A total of 17 peptides from the Zmaxl sequence were selected for synthesis and 
subsequent phage display screen for scFv molecules. All peptide synthesis and phage 
display work was performed at Cambridge Antibody Technology (CaT) in Cambridge, UK. 
Peptides were selected based on criteria as described above. 

TABLE 12 



Protein Domain 


Zmaxl Residues 


LRP6 Residues 


% Identity 


Spacer 1 (+G171V) 


161-181 


148-168 


76% 


Spacer 1 (-G171V) 


161-181 


148-168 


76% 


EGF 1 


301-321 


288-308 


76% 


Spacer 2 


401-421 


388-408 


52% 


EGF 2 


611-631 


598-618 


62% 


Spacer 3 


781-801 


768-788 


62% 
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Protein Domain 


Zmaxl Residues 


LRP6 Residues 


% Identity 


EGF3 


921-941 


908-928 


10% 


Spacer 4 


1000-1021 


988-1008 


26% 


EGF4 


1229-1249 


1219-1239 


76% 


LDLR 1 


1261-1282 


1252-1272 


81% 


LDLR 2 


1300-1320 


1290-1310 


57% 


LDLR 3 


1338-1358 


1328-1348 


48% 


Cytosolic 1 


1418-1438 


1405-1425 


14% 


Cytosolic 1 


1516-1536 


1503-1525 


52% 


Cytosolic 1 


1535-1555 


1524-1544 


81% 


Cytosolic 1 


1595-1615 


1592-1613 


82% 


Spacer 2 (cross reactive) 


421-441 


408-428 


100% 



Note that a number of these regions (e.g., 401-421, 421-441, 781-801, and 1229- 
1249) share 100% identity with mouse LRP5 (see Fig. 26). Therefore, these may be used 
1 5 against both mouse and human forms of the protein. The peptide 421-441 was included to 
facilitate the generation of an antibody that would recognize both Zmaxl (LRP5) and LRP6 
(see Figure 27). Two peptides were synthesized spanning the HBM mutation site (Zmaxl 
residues 161-181), one with the Zmaxl sequence and the other containing the HBM 
sequence. 

20 Once scFv molecules were isolated, they were used as reagents in immunochemistry 

to detect Zmaxl (LRP5) protein expression in a variety of human normal and diseased 
tissues. The details of the scFV antibody immunohistochemical analysis of three phage 
clones against peptide 1000-1021 (i.e., IKRAKDDGTQPFVLTSLSQGQN; SEQ ID NO: 
714) of the extracellular domain of Zmaxl showed positive staining with cardiac muscle, 

25 kidney, lung and liver. Expression was also detected in prostate carcinoma. These results 
are consistent with mRNA tissue distribution profiles as well as with the published reports 
of LRP5 mRNA localization (Kim et a/., 1998 Biochem. 124: 1072-6). The resulting 
phage clones arise from pools and will be sequenced to identify potential variants in the Fv 
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region of the molecules. Once identified, the suitable scFVs can then be subcloned into 
variable heavy chain and variable light chain DNA constructs for cotransfection into COS 
cells for final assembly of an intact and functional immunoglobulin gamma (IgG) molecule. 
The IgG that is expressed by the cells can then be further characterized for specificity and 
5 reactivity as would be known in the art. 

19.3.2 Monoclonal antibody develop ment 

Monoclonal antibodies can be developed to Zmaxl (LRP5), LRP6, HBM and HBM- 
like proteins/polypeptides by complete cell and adenovirus immunization of, for example, 

10 Balb/c mice (antibodies to all forms are contemplated even when only a few examples are 
discussed in detail). Dendritic cells can be isolated from spleens of Balb/c mice, for 
example, and the cells expanded in vitro in the presence of growth factors IL-4 and GM- 
CSF. The dendritic cells can then be infected with HBM or Zmaxl adenovirus particles. 
The cells are then cultured for 24 hours prior to intravenous injection into Balb/c mice. 

15 Dendritic cells (lxlO 6 cells/mouse) are injected 2-3 times every 3-4 weeks over a three 
month period. 

Alternatively, purified HBM and Zmax DNAs in, for example, the pcDNA3.1 
expression vector, can be coated on colloidal gold particles. These particles can then be 
injected subcutaneously into the desired mouse using gene gun technology. Approximately, 

20 5 yug cDNA/mouse can be injected. Injections are performed 4-6 times every 2 weeks over 
approximately a 3 month period. 

Another option is that cells (any species of animal, but preferably Balb/c mouse 
strain or the same species as the mouse strain being used which is related to limit antigen 
response to non-specific protein) over expressing HBM and Zmaxl (LRP5) and their 

25 respective adenovirus will be injected into the mice every 2-3 weeks for a period of about 
1.5 to 3 months, as necessary. The bleeds from the mice can be tested for reactivity with 
the native and denatured protein by ELISA (using purified protein or protein-fusions), cell 
based ELISA, immunohistochemical staining and Western blotting. Serum samples from 
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the animals can be screened by FACS (fluorescent activated cell sorting) using cells 
infected with Zmaxl or HBM adenovirus. The spleen cells (antibody producing cells) from 
the mice with the strongest reactivity can then be fused with a myeloma to generate the 
hybridoma cells. The conditioned media from the hybridoma is then screened for the 
5 positive cell colonies for subsequent cloning. These cloned cells can then be injected into 
the intraperitoneal space in mice for ascites production. 

19.3.3 Polyclonal Antibo dy A pp l ications 

Polyclonal antibodies directed against Zmaxl (LRP5) and LRP6 were developed to 
10 determine the function of these proteins, analyze the expressed pattern and levels in various 
tissues, cells or any biological sample. Similar antibodies could be prepared which 
distinguish the wild type forms from the HBM and HBM-like variants. 

Uses for polyclonal antibodies against Zmaxl, HBM, HBM-like polypeptides, and 
LRP6 include but are not limited to: analysis of bone cross-sectional mounts, tissue 
15 distribution, evaluation of expression of the protein from bone biopsy samples of 

affected/non-affected family members (e.g., bone cell digests, explants of bone marrow 
stromal cell cultures), evaluation of protein expression levels in transiently or stably 
transfected cells, evaluating protein concentration in tissues, serum or body fluid, 
purification of full length or fragments of these proteins for ligand hunting and functional 
20 assay development, identification of ligands or proteins which interact with these proteins, 
and elucidations of the signaling pathways of LRP6, Zmaxl (LRP5) and HBM, and related 
variants. 

For example, Zmaxl cloned in pcDNA3.1 (Invitrogen, Carlsbad, CA). This was 
used to generate 35 S-labeled in vitro translated (Promega, Madison, WI) Zmaxl. Antibody 
25 (10 jig/ml) 3109 and 3110, which are directed against peptide immunogen, 

RWKASKYYLDLNSDSDPY (SEQ ID NO:711), was combined with 20 \d of the in vitro 
translated product in the presence of either 10 jig/ml specific peptide (i.e., 
RWKASKYYLDLNSDSDPY; SEQ ID NO: 711) or non-specific peptide (i.e., 
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SGWNECASSNGHCSH; SEQ ID NO: 712) or no peptide and incubated for 1.5 hr at 4°C. 
Protein A Sepharose was then added to the samples (previously blocked for about 1.5 hr with 
reticulocyte lysate), and the samples were shaken for 1 hour at 4°C. The protein A Sepharose 
was washed 3 times with 0,5 ml of PBS. The bound protein was subsequently separated on a 
5 4-12% gradient NuPAGE gel (Invitrogen) according to manufacturer's instructions. The gel 
was dried at 80 °C for 30 min and then exposed to Kodak X-OMAT-AR film for 24 to 48 hr. 
The specific peptide was observed to significantly compete for the 35 S-labeled Zmaxl 
immunoprecipitated protein with either antibody. The competition was not observed with a 
non-specific peptide. 

10 These antibodies can also be used for immunohistochemistry. For example, HBM 

transgenic and wild-type mice were sacrificed using C0 2 narcosis. Mouse calvariae were 
removed intact, and the soft tissues gently dissected. The bones were fixed in 10% phosphate 
buffered formalin for 24 hours for further processing and analysis. After fixation, calvariae 
were decalcified in TBD-2 decalcifying agent (Shandon, Pittsburgh, PA) for about 7-8 hours 

15 and then dehydrated in graded alcohol. Calvariae were then bisected perpendicular to the 

sagittal suture through the central portion of the parietal bones parallel to the lambdoidal and 
coronal sutures and embedded in paraffin. Four to six 5 fxra thick representative sections 
were cut. 

For example, the rabbit polyclonal antibody, Zmaxl/HBM (i.e., antibody 3109 and 
20 3110) recognize Zmaxl (LRP5) in both HBM transgenic and wild-type mouse calvariae. An 
anti-Zmaxl or anti-HBM antibody can be used to detect Zmaxl or HBM protein in order to 
evaluate its abundance and pattern of protein expression. Detection can be facilitated by 
coupling (i.e., physically linking) the antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic groups, fluorescent materials, 
25 luminescent materials, bioluminescent materials, and radioactive materials. Examples of 
suitable enzymes include horseradish peroxidase, alkaline phosphatase, P-galactosidase, or 
acetylcholinesterase; example of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
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fluorescein, dansyl chloride or phycoerythrin; and example of a luminescent material includes 
luminol; examples of bioluminescent materials include luciferase, luciferin and acquorien; 
and examples of suitable radioactive material include I25 1, 131 I, 35 S, and 3 H. Alternatively, a 
secondary antibody can be employed that detects the presence of the primary Zmaxl 
5 polyclonal antibody. An example would be an antibody that recognized all rabbit 

immunoglobulins. This secondary antibody could be coupled in an identical manner as 
described above to facilitate detection. Controls comprised samples with the avidin 
peroxidase, but without antibody. Intensive positive staining of stroma cells and 
mesenchymal cells was observed in the suture area. Pre-osteoblasts and osteoblasts were 

10 observed to stain within the periosteum and some osteocytes with antibody 3109 and 31 10 in 
the calvariae of the HBM compared to wild-type mice. High magnification of tissue calvaria 
sections of the HBM transgenic mice showed a pronounced cell membrane staining of the 
osteocytes and the cells within the suture area. 

Similar antibodies could be prepared to HBM-like proteins and polypeptides, as 

15 would be readily appreciated by the artisan of ordinary skill. 

20. Methods of Use: Oene Therapy 

In recent years, significant technological advances have been made in the area of gene 
therapy for both genetic and acquired diseases (Kay et al, 1997 Proc. Natl Acad. Set USA, 

20 94: 12744-6). Gene therapy can be characterized as the deliberate transfer of DNA for 

therapeutic purposes. Improvement in gene transfer methods has allowed for development of 
gene therapy protocols for the treatment of diverse types of diseases. Gene therapy has also 
taken advantage of recent advances in the identification of new therapeutic genes, 
improvement in both viral and nonviral gene delivery systems, better understanding of gene 

25 regulation, and improvement in cell isolation and transplantation. 

The preceding experiments identify the HBM gene as a dominant mutation conferring 
elevated bone mass. Additional HBM-like genes are identifiable based on the model and data 
presented herein regarding the propellers and structure of the HBM protein. The fact that this 
mutation is dominant indicates that expression of the HBM protein causes elevated bone 

133 

BNSDOCID: <WO 02092000A2J_> 



WO 02/092000 PCT/US02/14877 



mass. Older individuals carrying the HBM gene, and, therefore expressing the HBM protein, 
do not suffer from osteoporosis. These individuals are equivalent to individuals being treated 
with the HBM protein. These observations are a strong experimental indication that 
therapeutic treatment with the HBM protein prevents osteoporosis. The bone mass elevating 
5 activity of the HBM gene is termed "HBM function 1 ' or "HBM like phenotype" in the instance 
of other HBM variants. 

Therefore, according to the present invention, a method is also provided of supplying 
HBM function to mesenchymal stem cells (Onyia et al, 1998 J. Bone Miner. Res. 13: 20-30; 
Ko et aL, 1996 Cancer Res. 56: 4614-9). Supplying such a function provides protection 
10 against osteoporosis. The HBM gene or a part of the gene or other iZBM-like gene may be 
introduced into the cell in a vector such that the gene remains extrachromosomal. In such a 
situation, the gene will be expressed by the cell from the extrachromosomal location. 

Vectors for introduction of genes both for recombination and for extrachromosomal 
maintenance are known in the art, and any suitable vector may be used. Methods for 
15 introducing DNA into cells such as electroporation, calcium phosphate co-precipitation, and 
viral transduction are known in the art, and the choice of method is within the competence of 
one skilled in the art (Robbins, ed., Gene Therapy Protocols , Human Press, NJ (1997)). Cells 
transformed with the HBM gene (or HBM-like gene) can be used as model systems to study 
osteoporosis and drug treatments that promote bone growth. 
20 As generally discussed above, the HBM or HBM-like gene or biologically active 

fragment thereof, where applicable, may be used in gene therapy methods in order to increase 
the amount of the expression products of such genes in mesenchymal stem cells (in all 
instances where discussing HBM gene and its cognate product, the HBM-like gene and 
cognate protein is also contemplated). It may be useful also to increase the level of 
25 expression of a given HBM protein, or a fragment thereof, even in those cells in which the 
wild type gene is expressed normally. Gene therapy would be carried out according to 
generally accepted methods as described by, for example, Friedman, Therapy for Generic 
Diseases, Friedman, Ed., Oxford University Press, pages 105-121 (1991). 

A virus or plasmid vector containing a copy of the HBM gene linked to expression 
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control elements and capable of replicating inside mesenchymal stem cells, is prepared. 
Suitable vectors are known and described, for example, in U.S. Patent No. 5,252,479 and WO 
93/07282, the disclosures of which are incorporated by reference herein in their entirety. The 
vector is then injected into the patient, either locally into the bone marrow or systemically (in 
5 order to reach any mesenchymal stem cells located at other sites, i.e., in the blood). If the 
transfected gene is not permanently incorporated into the genome of each of the targeted 
cells, the treatment may have to be repeated periodically. 

Gene transfer systems known in the art may be useful in the practice of the gene 
therapy methods of the present invention. These include viral and non-viral transfer methods. 

10 A number of viruses have been used as gene transfer vectors, including polyoma, i.e., SV40 
(Madzakefa/., 1992/. Gen. Virol 73: 1533-6), adenovirus (Berkner, 1992 Curr. Top. 
Microbiol Immunol 158: 39-61; Berkner et al, 1988 BioTechniques, 6: 616-629; Gorziglia 
et al, 1992 J. Virol 66: 4407-12; Quantin et al, 1992 Proc. Natl Acad. Sci. USA 89: 2581- 
2584; Rosenfeld et al, 1992 Cell 68: 143-155; Wilkinson et al, 1992 Nucl. Acids Res. 20: 

15 2233-39; Stratford-Perricaudet et al, 1990 Hum. Gene Ther. 1: 241-256), vaccinia virus 

(Mackett et al, 1992 Biotechnology 24: 495-499), adeno-associated virus (Muzyczka, 1992 
Curr. Top. Microbiol Immunol 158: 91-123; Ohi et al, 1990 Gene 89: 279-282), herpes 
viruses including HSV and EBV (Margolskee, 1992 Curr. Top. Microbiol Immunol 158: 67- 
90; Johnson et al, 1992 J. Virol, 66: 2952-65; Fink et al, 1992 Hum. Gene Titer. 3: 11-9; 

20 Breakfield et al, 1987 Mol Neurobiol 1 : 337-371 ; Fresse et al, 1990 Biochem. Pharmacol, 
40: 2189-2199), and retroviruses of avian (Brandyopadhyay et al, 1984 Mol Cell Biol, 4: 
749-754; Petropouplos et al, 1992 J. Virol 66: 3391-3397), murine (Miller, 1992 Curr. Top. 
Microbiol Immunol 158: 1-24; Miller et al, 1985 Mol Cell Biol 5: 431-437; Sorge et al, 
1984Mol CellBiol.A: 1730-7; Maxmetal, 1985 J. Virol 54: 401-7), and human origin 

25 (Page et al, 1990 J. Virol 64: 5370-6; Buchschalcher et al, 1992 J. Virol 66: 2731-9). Most 
human gene therapy protocols have been based on disabled murine retroviruses. 

Non- viral gene transfer methods known in the art include chemical techniques such as 
calcium phosphate coprecipitation (Graham et al, 1973 Virology 52: 456-67; Pellicer et al, 
1980 Science 209: 1414-22), mechanical techniques, for example microinjection (Anderson 

135 



BNSDOCIO: <WO 02092000A2J_> 



WO 02/092000 PCT/US02/14877 



et al, 1980 Proc. Natl Acad. Set USA, 77: 5399-5403; Gordon et al, 1980 Proc. Natl. Acad. 
Set USA, 77: 7380-4; Brinster et al, 1981 Cell 27: 223-231; Constantini et al, 1981 Nature 
294: 92-94), membrane fusion-mediated transfer via liposomes (Feigner et al, 1987 Proc. 
Natl Acad. Set USA 84: 7413-7; Wang et al, 1989 Biochemistry 28: 9508-14; Kaneda et al, 
5 1989 J. Biol Chem. 264: 12126-9; Stewart et al, 1992 Hum. Gene Tlier. 3: 267-275; Nabel et 
al, 1990 Science 249: 1285-8; Lim et al, 1992 Circulation 83: 2007-1 1), and direct DNA 
uptake and receptor-mediated DNA transfer (Wolffs al, 1990 Science 247: 1465-8; Wu et 
al, 1991 BioTechniques 11: 474-85; Zenke et al, 1990 Proc. Natl Acad. Set USA 87: 3655- 
9; Wu e* al, 1989 J. C/zem. 264: 16985-7; Wolffs al, 1991 BioTechniques 11: 474-45; 

10 Wagner et al, 1990; Wagner e* al, 1991 Proc. iVatf. ,4catf. Set USA, 88: 4255-9; Cotten et 
al, 1990 Proc. Afatf. Acad. Set USA, 87: 4033-7; Curiel et al, 1991 Proc. Natl Acad. Set 
USA, 88: 8850-4; Curiel et al, 1991 Hum. Gene Ther. 3: 147-54). Viral-mediated gene 
transfer can be combined with direct in vivo vectors to the mesenchymal stem cells and not 
into the surrounding cells (Romano et al, 1998 In Vivo, 12: 59-67; Gonez et al, 1998 Hum. 

15 Mol Genetics 7:1913-9). Alternatively, the retroviral vector producer cell line can be 
injected into the bone marrow (Culver et al, 1992 Science 256: 1550-2). Injection of 
producer cells would then provide a continuous source of vector particles. This technique has 
been approved for use in humans with inoperable brain tumors. 

In an approach which combines biological and physical gene transfer methods, 

20 plasmid DNA of any size is combined with a polylysine-conjugated antibody specific to the 
adenovirus hexon protein, and the resulting complex is bound to an adenovirus vector. The 
trimolecular complex is then used to infect cells. The adenovirus vector permits efficient 
binding, internalization, and degradation of the endosome before the coupled DNA is 
damaged. 

25 Liposome/DNA complexes have been shown to be capable of mediating direct in vivo 

gene transfer. While in standard liposome preparations the gene transfer process is non- 
specific, localized in vivo uptake and expression have been reported in tumor deposits, for 
example, following direct in situ administration (Nabel, Hum. Gene Ther., 3:399-410 (1992)). 
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21. Methods of Use: Transformed Hosts and Transgenic Animals as Research Tools 
and for the Development of Pharmaceuticals 

Cells and animals that carry the HBM, HBM-like, Zmaxl (LRP5), or LRP6 genes, 
used as model systems, are valuable research tools to study and test for substances that have 
5 potential as therapeutic agents (Onyia et aL, 1998 J. Bone Miner. Res., 13: 20-30; Broder et 
aL, 1997 Bone, 21: 225-35). Discussion of one of these is meant to include the others, e.g., 
discussion of HBM is meant to include HBM-like variants, and so forth. 

Cells for this purpose are typically cultured mesenchymal stem cells. These may be 
isolated from individuals with somatic or germline HBM genes. Alternatively, the cell line 

10 can be engineered to carry the HBM or HBM-like gene, as described above. After a test 

substance is applied to the cells, the transformed phenotype of the cell is determined. Any 
trait of transformed cells can be assessed, including, for example, formation of bone matrix in 
culture (Broder et aL, 1997 Bone, 21: 225-235), mechanical properties (Kizer et aL, 1997 
Proc. Natl. Acad. Sci. USA 94: 1013-8), expression of marker genes and response to 

15 application of putative therapeutic agents. 

Transgenic modified animals and cell lines may be used to test therapeutic agents. 
Transgenic modifications include, for example, insertion of the Zmaxl (LRP5) gene, LRP6, 
HBM gene or HBM-like gene and disrupted homologous genes. Alternatively, the inserted 
Zmaxl, LRP6, HBM, and/or HBM-like gene of the animals may be disrupted by insertion or 

20 deletion mutation of other genetic alterations using conventional techniques, such as those 
described by, for example, Capecchi, 1989 Science 244: 1288; Valancuis et aL, 1991 MoL 
CellBioL 11: 1402; Hasty et aL, 1991 Nature 350: 243; Shinkai et aL, Cell, 68:855 (1992); 
Mombaerts et aL, Cell, 68:869 (1992); Philpott et aL, 1992 Science 256: 1448; Snouwaert et 
aL, 1992 Science 257: 1083; Donehower et aL, 1992 Nature 356: 215. After test substances 

25 have been administered to the animals, the growth of bone must be assessed. If the test 
substance modulates (e.g., enhances) the growth of bone, then the test substance is a 
candidate therapeutic agent. These animal models provide an extremely important vehicle for 
potential therapeutic products. 

The present invention also provides animals and cell lines wherein the expression of 
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endogenous genes are activated, and may be further amplified, which does not require in viti-o 
manipulation and transfection of exogenous DNA encoding Zmaxl (LRP5) or HBM proteins. 
These methods are described for example in PCT Application WO 94/12650 and U.S. Patent 
No. 5,968,502, both of which are herein incorporated in their entirety by reference. In 
5 addition, the present invention includes methods wherein endogenous activation or over- 
expression is achieved by in situ homologous recombination, non-homologous 
recombination, or illegitimate recombination methods. These methods are described for 
example in PCT Applications WO 99/15659 and WO 00/49162, both of which are 
incorporated herein in their entirety. 

10 

21.1 Crea tin g transgenic and gene-targeted animals 

The present invention provides genetically modified animals that recapitulate the 
human HBM or a HBM-like phenotype. The approaches taken involve the creation of both 
transgenic and gene-targeted animals that have the human G to T nucleotide substitution 
15 incorporated into the genome, express human Zmaxl (LRP5) or express a variant with a bone 
mass altering phenotype. These approaches can be used with any gene, such as HBM, HBM- 
like, LRP5 and LRP6 systems. 

21.1.1 Transgenic mice over-expressing the HBM polymorp hism 
20 Plasmid constructs were prepared that utilized either the CMVbActin or type I 

collagen promoters to drive expression of the human HBM cDNA. The most 
commonly-used promoters for mammalian expression are from cytomegalovirus (CMV), 
Rous sarcoma virus (RSV), Simian virus 40 (SV40), and EF-la (human elongation factor la- 
subunit). CMV is derived from the human cytomegalovirus immediate-early viral promoter. 
25 CMV is a stronger promoter in most cell lines than either RSV or SV40. The RSV promoter 
is derived from an avian virus and tends to be a strong promoter in avian cell lines. The 
SV40 promoter expresses well in cell lines that carry the large T antigen, such as COS-1. In 
these cell lines, the plasmid is replicated to higher copy numbers. EF-la is beginning to be 
more widely used for recombinant protein expression. EF-la is the promoter from the human 
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elongation factor la-subunit, a gene that is highly expressed and conserved in mammalian 
cell lines. 

The chimeric CMVbActin promoter is a strong promoter that has been shown to 
produce ubiquitous gene expression in transgenic mice including bone. This promoter was 
5 chosen to drive expression of HBM in a manner consistent with the reported widespread 

expression of the endogenous mouse Zmaxl (LRP5) gene. Although the HBM phenotype is 
observed in bone, the HBM gene may have direct or indirect effects in other tissues. 
Therefore, other strong ubiquitous promoters may be utilized as would be known to those 
skilled in the art. 

10 Type I collagen promoters provide tissue-restricted gene expression wherein 

expression is primarily limited to bone. Other bone-specific promoters are available that 
could result in expression of HBM in bone. For example, promoters associated with 
proliferation of osteoblasts include histone, type I collagen, TGFJ31, MSX2, cfos/cJun and 
Cbfal may be used. Promoters associated with bone matrix maturation including alkaline 

15 phosphatase, MGP, Cbfal, Fra/Jun and Dlx5 also can be used. Promoters associated with 

bone mineralization such as osteocalcin, osteopontin, bone sialoprotein and collagenase also 
can be used. The promoter chosen would be determined by, for example, the tissue 
expression, the degree of regulatable control and the like as would be known to the skilled 
artisan. For example, the type I collagen promoters were chosen to insure that HBM would 

20 be expressed in bone in a temporal, spatial and bone cell-specific pattern resembling the 
endogenous pattern of Zmaxl (LRP5) expression in bone. 

21.1.2 Transgenic mice over-expressing th e w ild-type (LRP5) Zmaxl g ene 
Plasmid constructs were prepared using the CMVbActin and type I collagen 
25 promoters driving expression of Zmaxl (LRP5). These animals can serve as a control animal 
model for the HBM, HBM-like and LRP6 transgenic mice (all of which are contemplated 
when discussing LRP5/Zmaxl). Additional controls include non-transgenic littermates and 
wild-type animals of an identical genetic background. Methods for preparing these animals 
would be similar to what is discussed for mice which over express the HBM polymorphism. 
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21.1.3 LRP6 gene targeted knock out mine 

LRP6 knock-out mice were generated using Omnibank embryonic stem (ES) cells 
carrying a gene trap vector which inserted into the first intron of the LRP6 gene. The insert 
location was determined to be the LRP6 gene by an Omnibank Sequence Tag (OST) 
5 generated by reverse transcription PCR (RT-PCR) of a fusion transcript comprised of 5' gene 
trap vector sequence spliced to the host gene transcript 3* of the insertion site. The gene trap 
vector functionally knocks out the mouse LRP6 gene by forced spicing of LRP6 exon 1 to the 
IRES-LacZ-PolyA element of the gene trap, preventing transcription of LRP. 

Chimeric mice were generated with ES cells, identified as OST38808, by injection 

10 into C57BL/6 albino host blastocysts which were then transferred to pseudopregnant females 
and allowed to develop through birth. Genu line chimeras were backcrossed to 129SvEVBrd 
strain mice to maintain the knockout allele of LRP6 on an inbred 1298SvEvBrd genetic 
background. Germ line transmission of the LRP6-KO allele was identified by PCR 
amplification of a gene trap specific sequence. Heterozygous LRP6-KO mating pairs were 

1 5 used for continued breeding. The genotype of wt and LRP6-KO progeny is determined by tail 
DNAPCR. 

Measurements of bone density at 9 weeks of age in female heterozygous knock-out 
mice has shown significant (p<0.05) decreases in bone volume, trabecular number, and 
trabecular thickness as measured by uCT. These results are consistent with the hypothesis 
20 that LRP6 is also involved in modulating bone density and is a target for development of 
therapies and drugs. Accordingly, LRP6 transgenic animals and transgenic animals 
expressing bone modulating variants of LRP6 are contemplated within the scope of the 
invention. 

25 21.1.4 Gene-targeted mice expressing the HRM pol ymorphism 

A gene-targeting construct was prepared that could be used to create animals 
containing a HBM or HBM-like knock-in (KI) allele and a Zmaxl (LRP5) knock-out (KO) 
allele. The gene-targeting construct contained the HBM polymorphism in exon 3 and 
included a neomycin selection cassette that was linked to a transcriptional stop sequence and 
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was flanked with lox P sites. The HBM polymorphism in mouse Zmaxl results in a glycine 
to valine change in the amino acid sequence at position 170 of the mouse LRP5 homolog 
(Genbank Accession No. AF064984). Homologous recombination is used to stably introduce 
the construct into the mouse genome. If the transcriptional stop sequence functioned to 
5 completely block transcription of the modified Zmaxl allele, then a functional Zmaxl knock- 
out allele would be generated. This would facilitate production of a homozygous knock-out 
animal for the Zmaxl gene. 

To create the knock-in allele, Cre recombinase could be used to excise the neomycin 
selection cassette leaving behind the modified exon 3 and one copy of the loxP site. Cre 

10 could be introduced into single-cell fertilized embryos to facilitate ubiquitous expression of 
HBM or by crossing animals with transgenic mice to obtain bone-specific HBM expression. 
Homozygous animals could be made for the HBM knock-in allele. Alternatively, animals 
could be created by nuclear transfer techniques, wherein nuclei from homozygous animals is 
transferred into a prepared oocyte (e.g., enucleated) as is known in the art. See, e.g., 

15 Campbell et al.. 9 Nature 380: 64-68 (1996). Additional methods of creating knock out mice, 
include engineering a homologous recombination vector wherein the ATG start codon is 
deleted or mutated, engineering a frame-shift mutation into the vector, engineering deletions 
of critical portions of the promoter region, and/ or engineering a vector to delete critical 
regions of the gene. 

20 The methods used for the LRP5, LRP6 and HBM mice can similarly be used for other 

variants of these genes. 

21.2 Materials and Methods 

21.2.1 Construction of the Zmaxl (LRP5) plasmid Zmaxl GQ AS 
25 The full-length Zmaxl cDNA construct has been engineered into the Xbal-Notl sites 

of the pCMVSPORT6.0 vector from Life Technologies (part of the Gateway cloning system) 
to create ZmaxlGI_3AS. The insert (5,278 bp) can be released from the vector by digestion 
with either HindUl orXbal on the 5 f side together with either EcdRV or EcoBl on the 3' end. 
A similar construct can be prepared for LRP6 and HBM-like nucleic acids as would be 
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apparent to one of ordinary skill and is contemplated as well in the discussion of this section. 

The Zmaxl (LRP5) construct was generated from four independent partial clones. 
These clones were isolated from a Zmaxl specific primed cDNA library. A partial Zmaxl 
cDNA clone existed in the internal survey sequencing clone set as L236BJP0049E08 isolated 
5 from an oligo-dT primed HeLa cell cDNA library. This clone was truncated at the 5 -primed 
end. In order to isolate more 5-prime containing fragments necessary to generate a full length 
cDNA, a Zmaxl gene-specific cDNA library was generated from Clontech human liver poly- 
A mRNA (catalog #6510-1, lot# 9060032) and Life Technologies Superscript® Plasmid 
System for cDNA Synthesis and Plasmid Cloning kit (catalog no. 18248-013). This library 

10 was designated as L401 . The manufacturer's protocol was carried out with the following 

modifications. l)In both first and second strand synthesis reactions, DEPC-treated water was 
substituted for <x- 32 P-dCTP. 2)Reverse transcription was primed using oligonucleotides that 
were selected to be specific for the Zmaxl gene at approximately 1 kb intervals. These 
sequences were checked using the program BLAST against the public databases to ensure 

15 Zmaxl specificity. 3)Two separate reverse transcription reactions were performed. The first 
reaction, (A), was primed with oligonucleotides which annealed to the more 3' regions of 
Zmaxl (LRP5) as follows: 

471 14: 5'-CGTACGTAAAGCGGCCGCTTGGCAATACAGATGTGGGA-3' (SEQ ID NO: 715) 

471 16: 5'-CGTACGTAAAGCGGCCGCAGTAGCTCCTCTCGGTGGC-3' (SEQ ID NO: 716) 

20 471 18: S^CGTACGTAAAGCGGCCGCGCTCATCATGGACTTTCCG-S' (SEQ ID NO: 717) 

47120: S'-CGTACGTAAAGCGGCCGCGCACTGCTGTTTGATGAGG-S' (SEQ ID NO: 718) 

The second reaction, (B), used the previously mentioned four oligonucleotides, as well as 

47108: 5'-CGTACGTAAAGCGGCCGCGAGTGTGGAAGAAAGGCTGC-3' (SEQ ID NO: 719) 
25 47 1 10: S-CGTACGTAAAGCGGCCGCAGTAGAGCTTCCCCTCCTGC-S' (SEQ ID NO: 720) 

471 12: 5'-CGTACGTAAAGCGGCCGCGTCCATCACGAAGTCCAGGT-3' (SEQ ID NO: 721) 
All oligonucleotides contained a Not! linker sequence and were used at a concentration of 

0.02 ug/ul. 4)The Sa/I-adapted cDNA from both reverse transcription reactions was size- 
fractionated by electrophoresis on 1% agarose, 0.1 ug/ml ethidium bromide, lx TAE gels. 
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The ethidium bromide-stained cDNA between 0.6 and 8.0 kb was excised from the gel. The 
cDNA was purified from the agarose gel by electroelution (ISCO Little Blue Tank 
Electroelutor®) using the manufacturer's protocol. The purified cDNA from reactions A and 
B were then pooled together. 5) The size-fractionated Sa/I-adapted cDNA was ligated to 
5 Notl-SaH digested pBluescript® (Stratagene, La Jolla, CA). 

Ligated library cDNA (3 ml) was used to transform electrocompetent E. coli cells 
(ElectroMAX® DH10B cells and protocol, Life Technologies catalog no. 18290-015, BioRad 
E. coli pulser, voltage 1.8 KV, 3-5 msec pulse). The transformed cells were plated on LB- 
ampicillin (100 ng/ml) agar plates and incubated overnight at 37°C. Approximately 10 6 
10 colony forming units (cfu) were plated at a density of 50,000 cfu/150 mm plate. Cells were 
washed off the plates with LB media, and collected by centrifugation. Plasmid DNA was 
purified from the cells using the QIAGEN Plasmid Giga Kit and protocol (catalog no. 12191) 
at a final concentration of 2.05 p.g/jnl. 

Two probes for use in library screening were generated by the polymerase chain 
15 reaction (PCR) using 100 ng of library L401 as template. Standard PCR techniques were * 
used. A reaction mixture contained 10 pmol of each oligo primer; 0.2 mM each dATP, 
dTTP, dCTP and dGTP (PE Applied Biosystems catalog no. N808-0260); 1.5 units Expand™ 
High Fidelity Taq DNA polymerase and lx PCR reaction buffer (Roche Molecular 
Biochemicals, catalog no. 732-641; 10 mM Tris-HCl, 1.5 mM MgCl 2 , 50 mM KC1, pH 8.3). 
20 The mixture was incubated at 99°C for one minute, followed by 30 cycles of 96° C for 15 
seconds, 50° C for 30 seconds, 72° C for 1 minute with a final incubation at 72° C for 7 
minutes (MJResearch DNA Engine® Tetrat PTC-225). The first was generated using oligos 

107335: 5 , -CAGCGGCCTGGAGGATGC-3 f (SEQ ID NO: 722) 

107338: 5-CGGTCCAGTAGAGGTTTCG-3' (SEQ ID NO: 723) 

25 which amplify a Notl-SaK fragment of the Zmaxl gene. The second was generated using 
oligos 

107341 : 5^CATCAGCCGCGCCTTCATG-3' (SEQ ID NO: 724) 

107342: 5'-CCTGCATGTTGGTGAAGTAC-3 ' (SEQ ID NO: 725) 

which amplify a SacVKpnl fragment of the Zmaxl gene. Both PCR products were purified 
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using the Qiaquick® kit and the manufacturer's protocol (Qiagen catalog 28106) then 
subcloned into the vector pCRII-TOPO® (Invitrogen catalog no. K4600) following the 
manufacturer's protocol. Positive subclones were identified by restriction digestion of 
purified plasmid DNA (using standard molecular biology techniques) and subsequent DNA 
5 sequence analysis (ABI Prism BigDye Terminator Cycle® sequencing, catalog no. 4303 154, 
ABI 377 instruments). Probe DNA was isolated by EcdKL restriction digestion (New 
England Biolabs, catalog no. R0101L) of the respective sequence-verified pCRII-TOPO® 
clone. Restriction fragments were size-fractionated by gel electrophoresis on 1% agarose, 0.1 
ug/ml ethidium bromide, lx TAE gels. Insert DNA was excised from the gel and purified 

10 using the Clontech NucleoSpin® Nucleic Acid Purification Kit (catalog no. K305 1-2) 

following the manufacturer's protocol. The purified DNA fragment (25-50 ng) was labeled 
with Redivue ® (a 32 P)-dCTP (Amersham Pharmacia, catalog no. AA0005) using the Prime-It 
II® Random Primer labeling Kit and protocol (Stratagene, catalog no. 300385). 
Unincorporated dCTP was removed with Amersham's NICK® column and protocol (catalog 

15 no. 17-0855-02). 

Two rounds of screening library L401 were initiated to isolate fragments of the Zmaxl 
(LRP5) gene. In the first, forty-three 150 cm LB -100 jxg/ml ampicillin agar plates were 
plated with primary transformants from L401 at a density of about 3,000-4,000 colonies per 
plate. This library was screened using the 32 P-labeled probe (Notl-Sall fragment) as described 

20 above at 500,000-1 ,000,000 cpm/ml hybridization buffer, using standard molecular biology 
protocols. From this primary screen, 13 single colonies were identified based on positive 
hybridization to the Zmaxl probe. Plasmid DNA, prepared using the QIAprep Spin Miniprep 
Kit and protocol (Qiagen Inc., catalog no. 27106), was analyzed by restriction digestion and 
sequence analysis as described above. cDNA clone # 44 was isolated from this screen and 

25 sequence verified to contain a partial Zmaxl clone. 

In the second library screen, one hundred-and-four 150 cm LB ampicillin 100 ug/ml 
agar plates were plated with primary transformants from L401 at a density of 3000-4000 
colonies per plate. This library was screened using the 32 P-labeled probe (SacI-KpnT) exactly 
as previously described. From this primary screen, 48 colonies were identified based on 
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positive hybridization to the Zmaxl (LRP5) probe. Since these colonies were not single 
colony isolates, a secondary screen was initiated where each of the 48 primary isolates was 
plated at a density of approximately 500 colonies per plate. These colonies were then 
screened exactly as the primaries using the labeled Sacl-Kpnl fragment as probe. Thirty-four 
5 of the 48 primary clones resulted in positive hybridization to the Zmaxl probe and were 
isolated as single colonies. Plasmid DNA was prepared and analyzed as described above. 
cDNA isolate #71 J2 was isolated from this screen and sequence verified to contain a partial 
Zmaxl clone. 

In all cases, the sequence of any Zmaxl (LRP5) isolate was compared to a reference 
10 sequence (i.e., the sequence of the wild-type Zmaxl allele from an affected member of the 
HBM kindred). This analysis was important since DNA polymorphisms had been reported 
for this gene in the literature. This reference sequence is predicted to encode a polypeptide of 
Genbank Accession No. AF077820. 

The four independent partial clones used to prepare ZmaxIGI_3AS are as follows: 
15 1). Bases 1-1366: AXbal-SaR fragment was obtained from a Zmaxl cDNA 

construct, GTC.Zmaxl_13. GTC.Zmaxl_13 contains a 5075 BP insert containing the 
entire ORF of LRP5. The clone was blunt end cloned in the EcdRV site of pSTBlue- 
1. This clone was generated by fusing a 5' clone derived from screening a bone 
random primed cDNA library in a pBluescript™ II derivative with a 3' clone derived 
20 from a PCR product from a bone dT primed cDNA library in pBluescript™ II. PCR 

was performed using LRP5 specific forward primer 5 f -GCCCGAAACCTCTACTGG 
ACCGAC-3 ? (SEQ ID NO: 726) and reverse primer 5 f -GCCCACCCCATCACAGTTCA 
CATT-3' (SEQ ID NO: 727) using DNAzyme polymerase. The resultant 3.7 kb PCR 
product was cloned into PCR-XL-TOPO. To generate the full length clone the 5' and 
25 y plasmids were transformed into DM1 (dam-) from Gibco/BRL. The 5' plasmid was 

digested with Xbal and the 3' plasmid was digested with HindSR. The digested 
plasmids were filled in with T4 polymerase to generate blunt ends and cut with Bell. 
the 1.7 kb 5' fragment and 3.5 kb 3* fragments were gel purified, ligated together, and 
cloned in the EcoRV site of pSTBlue-1 . It provides a short 5 'UTR, with coding 
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sequence beginning at base 100. Furthermore, it carries some additional restriction 
sites at the 5' multiple cloning site. This fragment also contains a DNA 
polymorphism relative to the Genbank Accession No. AF077820 sequence at position 
558 resulting in an A (AF077820) to a G change; this mutation does not result in an 
5 amino acid difference (Pro). 

2). Bases 1367-2403: This clone was obtained from a Zmax7-gene primed cDNA 
library made from commercial human liver RNA described above. This fragment is a 
SaR-BamHl piece of DNA obtained from isolate #44. The sequence is identical to 
Genbank Accession No. AF077820. 
10 3). Bases 2404-4013: This BamHl-BssWI fragment was obtained from isolate 

#71-2 from the same library as described above. At position 3456, there is a DNA 
polymorphism resulting in a G (AF077820) to an A. This nucleotide difference does 
not change the encoded amino acid (Val). 

4). Bases 4014-5278: This BssHR-Notl fragment came from an internal clone, 
15 L236BJP0049E08. It was obtained from an oligo-dT primed HeLa cell cDNA 

library. The stop codon occurs at base 4947. The clone contains 331 bp of 3' UTR 
sequence, including a 120 bp poly-A tail followed by the Notl site. The 3 ' Notl site 
used in this subcloning step is a result of an added linker that was introduced at the 
end of the poly-A tail during library construction. A DNA polymorphism is present at 
20 base 4515 resulting in a G (AF077820) to C change that is silent at the amino acid 

level (Leu). 

To generate the 5' section of the Zmaxl (LRP5) gene, the Xbal-SalL fragment and 
SaR-BamJIl fragment were ligated into the Xbal-BaniHI sites of pBluescript® (Stratagene). 
The 3 ! section of the gene was obtained by ligating the 1.61 kb BaniHl-BssHIl fragment from 
25 Zmaxl isolate #71_2 to the 1.26 kb BssHII-NotI fragment from L236B JP0049E08. These 

two fragments were ligated into the BamHL-NotI sites of pBluescript®. The full length Zmaxl 
cDNA was engineered into the Xbal-Noil sites of the vector pCMVSPORT6.0 (Life 
Technologies) by ligation of the Xbal-BaniHI 5' section and the BamHl-Notl 3' section. The 
resulting plasmid, ZmaxGI_3 AS, contains an insert of 5278 bp from the Xbal site to the Not! 
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site of the vector's multiple cloning site. This clone is in the antisense orientation with 
respect to the CMV promoter present in the vector. Zmaxl coding sequence begins at base 
100 and ends at base 4947, followed by 331 bp of 3-primed UTR sequence including a 120 
bp poly-A tail. This foil length cDNA contains three DNA polymorphisms from the 
5 reference sequence (GenBank Accession No. AF077820) that do not alter the predicted 

amino acid sequence. These polymorphisms are at position 558 resulting in an A to G that 
maintains the proline residue; at position 3456 resulting in a G to A that maintains the valine 
residue; and, at position 4515 resulting in a G to C that maintains the leucine residue. 

The sequence of ZmaxlGI_3AS (Figure 25) also contains a DNA polymorphism 

10 relative to SEQ ID NO. 1 at base 4088 resulting in a C (SEQ ID NO: 1 and Genbank 

Accession No: AB017498) to T change that results in an amino acid change at position 1330 
of alanine to valine. This is consistent with the sequence determined in the wild-type allele 
from an affected member of the HBM kindred as well as with the published sequence of 
Genbank Accession No. AF077820. ZmaxlGI_3AS also has 29 additional bases at the 5' end 

15 relative to SEQ ID NO: 1, as well as 129 bases at the 3 f end consisting of an extra G, 120 : 
bases of poly-A tract, and the Not! site. 

21.2.2 Creation of the HBM mutation Gl 71 V 

The HBM mutation that results in a predicted amino acid change from glycine to 
20 valine at amino acid 171 was introduced into the foil length human Zmaxl (LRP5) cDNA 

(plasmid ZmaxlGI_3AS) using PCR to change the G at position 611 to a T. Introduction of 
the HBM mutation was done using oligos 107335: (5-CAGCGGCCTGGA GGATGC-3'; 
SEQ ID NO: 728) and 49513: (5'-CGGGTACATGTACTGGACAGC TGATT AGC-3 1 ; SEQ 
ID NO: 729), which flank the endogenous Notl site of the Zmaxl (LRP5) gene. This method 
25 creates a new PvuII site at the 3 1 end of the PCR product. A second PCR reaction was 

completed using oligos which introduce a Seal site at the 5' end of the product and contains 
the endogenous Sail site of Zmaxl in the 3-primed end. PCR products were purified using 
the QiaQuick® procedure (Qiagen Inc.); subcloned into the vector pCRII-TOPO (Invitrogen) 
as described above. Plasmid DNA was purified from single bacterial colonies and analyzed 
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by restriction digest and subsequent sequence analysis, all as described above. The sequence- 
verified pCRII-TOPO clones were restriction digested with Notl-PvuU and Seal-Sail, 
respectively. The resulting DNA fragments were size fractionated and purified as described 
above. These two fragments were then subcloned into the vector, pBluescript® that had been 
5 prepared by Notl-Sall digestion. Both PvuU and Seal produce blunt ends when used to digest 
double stranded nucleic acids. Thus, the resulting ligated fragment fails to recreate either the 
PvuTL or Seal site and contains only the consensus Zmaxl sequence, with the exception of the 
newly introduced HBM mutation. To introduce the mutation into the full length Zmaxl gene, 
this resulting plasmid was digested with Mscl and Sail, while the 5' region of Zmaxl was 

10 obtained by Xbal-Mscl digestion of Zmaxl plasmid GTC.Zmaxll 3. These two fragments 
were ligated together into Xbal-Sall digested pBluescript®, in effect creating a similar 1.366 
kb Xbal-Sall fragment. The only difference being that this construct contains the HBM 
mutation described above. The full length HBM cDNA then was assembled into 
pCMVSPORT6.0 exactly as described above for the Zmaxl gene, with the substitution of this 

1 5 newly created Xbal-Sall fragment containing the HBM mutation. The entire cDNA insert 
was verified by DNA sequence analysis and the introduction of the HBM mutation was 
confirmed. 

The resulting plasmid, HBMGIJ2AS (Fig. 24), contains an insert of 5,278 bp from the 
Xbal site to the Notl site of the vector's multiple cloning site. This clone is in the antisense 

20 orientation with respect to the CMV promoter in the vector. HBM coding sequence begins at 
base 100 and ends at base 4947, followed by 33 1 bp of 3-primed UTR sequence which 
includes a 120 bp poly- A tail. This full length cDNA contains three DNA polymorphisms 
from the reference sequence, which do not alter the amino acid sequence. These 
polymorphisms occur at position 558, resulting in an A to G change that maintains the 

25 proline residue; at position 3456 resulting in a G to A change that maintains the valine 

residue; and, at position 4515 resulting in a G to C change that maintains the leucine residue. 
Additionally, the HBM mutation is present at position 61 1 (G in Zmaxl to T in HBM) which 
results in a predicted amino acid change of glycine to valine at amino acid position 171, as 
found in affected members of the HBM kindred. This insert sequence was used to generate 
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the construct used for HBM over-expressing transgenic mice. 
21.3 Transgene preparation 

The examples provided herein are illustrations of how transgenic animals can be 
5 prepared. Additional transgenic animals can be prepared as would be known in the art. See, 
for example, Glenn Monastersley et al.., ed, Strategies i n Transgenic Animal Studies (Amer. 
Soc. Microbiology 1995) and the references cited therein. 

21.3.1 CMVhActin promoter-HBM cDNA (HBMMCRA) 

10 To prepare the CMVZ>actin-HBM construct, pCX-EGFP, a plasmid containing the 

chimeric CMV6actin promoter, was purified as a 4778 bp EcoRI fragment. Subsequently, 
the HBM cDNA was excised from HBMGIJ2AS as a 4994 bp XbaVDral fragment, treated 
with Klenow fragment of DNA polymerase, ligated to EcoKL linkers, and digested with 
EcoRI. This fragment was then inserted into the EcoRI site of pCX-EGFP. A Spel/HindUI 

15 7265 bp CMV&actin-HBM fragment was purified for microinjection into mouse embryos; 

21.3.2 Type T collagen promoter-HBM cDNA (HBMMTTC) 

The rat type I collagen promoter-HBM construct was created by first replacing the 
pBS(SK-) (Stratagene) polylinker with another polylinker (i.e., comprising Kpn\ - Spel - 

20 HindBl - Bg\H - Ndel - Sail - Smdl - EcoRI - Pstl - BamHL - Xbal - Seal - Ncol - Clal - 
Notl - SacTL - SacI), that is referred to as BS(SK-)A/D. The S V40 splice and poly (A) n Xbal- 
Ncol region (750 bp) from pcDNA I (Invitrogen, Inc.) was directionally cloned into BS(SK- 
)A/D. Next, a 4994 bp EcoKL HBM cDNA fragment (above) was cloned into the EcoRI site. 
A 3640 bp Xbal, type I collagen promoter fragment was subcloned into thecal site of 

25 BS(SK-) (Stratagene). The promoter fragment was then excised from BS(SK-) with SacE, 

blunt-ended with T4 DNA polymerase, digested with Spel, and ligated into the HBM BS(SK- 
) A/D construct, which was digested with Nde I, blunted with T4 DNA polymerase, and 
digested with Spel. A SpeVClal 9435 bp type I collagen-HBM fragment was purified for 
microinjection into mouse embryos. 
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21.3.3 CMVhActin promoter-Zmax! cDNA (Zm ax 1 WTCB 
The CMV&Actin promoter-HBM cDNA construct from above was used to generate 
the final plasmid. The following three fragments were ligated together: 1) a 6.34 kb Xbal- 
Kpnl backbone fragment from HBMMCBA; 2) a 0.64 kb Xbal-Sapl fragment from 
5 HBMMCBA containing the 3' end of the bActin promoter and the 5' end of the HBM cDNA; 
and 3) a 2.8 kb Sapl-Kpnl fragment derived from the Zmaxl cDNA that contains the wild- 
type sequence. A 7.2 kb Spel - HindUl CMVfeActin - Zmaxl fragment was purified for 
micro-injection into mouse embryos. 

10 21.3.4 Type T collagen promoter-Zmax 1 cPNA (Zmaxl WTTTC) 

The type I collagen - HBM cDNA construct from above was used to generate the 

final plasmid. The HBMMTIC plasmid was linearized with HindUl and cut with either Sail 

to yield a 8.52 kb HindUl - Sail fragment or Sapl to yield a 2.98 kb Sapl - HindUl fragment. 

A 2.8kb Sapl - Sail fragment from the Zmaxl (LRP5) cDNA containing the wild-type 
15 sequence was then ligated to the above two fragments to yield the final plasmid. A 9.4 kb 

Spel - Clal type I collagen - Zmaxl fragment was purified for micro-injection into mouse 

embryos. 

21.4 Confirmation of transgene expression in vitro 
20 Plasmid constructs for HBMMCBA, HBMMTIC, Zmaxl WTCBA and Zmaxl WTTIC 

were transiently transfected into human osteoblast (HOB) cells to measure mRNA expression 
as a test for functionality. 

21.4.1 Transient Transfections 
25 HOB-02-02 cells are a clonal, post-senescent, cell line derived from the HOB-02-C1 

cells (Bodine et. al, 1996, Bone Miner. Res. 11: 806-819). Like the parental cell line, the 
HOB-02-02 cells express the temperature-sensitive SV40 large T-antigen mutant, tsA2Q9. 
Consequently, these cells proliferate at the permissive temperature of 34°C, but stop dividing 
at non-permissive temperatures of 37 °C or above. Also like the parental cell line, the HOB- 
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02-02 cells are cultured with Growth Medium P-MEM/F-12 containing 10% heat 
inactivated fetal bovine serum, 1% penicillin-streptomycin and 2 mM GlutaMAX-1) at 34°C 
in a 5% C0 2 /95% humidified air incubator (Forma Scientific, Marietta, OH). 

For the transient transfections, the HOB-02-02 cells were seeded with Growth 
5 Medium at 400,000 cells/well into 6-well plates and incubated overnight at 34°C. The cells 
were transfected with 0.3 mg/well of either the CMVpActin-HBM expression plasmid, the 
Type I Collagen-HBM expression plasmid or the corresponding empty vectors using 
LipofectAMINE 2000 transfection reagent according to the manufacturer's instructions (Life 
Technologies, Rockville, Md). After a 24 hr incubation at 34°C, the medium was changed, 
10 and the cells were incubated for an additional 24 hr at 39°C. At the end of this last 

incubation, the cells were rinsed with Hank's buffered salt solution. Total cellular RNA was 
then isolated using TRIzol® according to the manufacturer's instructions (GibcoBRL, Grand 
Island, NY). The RNA was treated with RNase-free DNase in order to remove contaminating 
DNA as previously described (Bodine etaL, 1997,7. Cell. Biochem. 65: 368-387). 

15 

2L4.2 TaqMan® .assay for mRNA expression 

TaqMan® primers and probes were chosen based on human and mouse Zmaxl 
(LRP5) cDNA sequences. The selected sequences were designed to be gene-specific by 
analysis of an alignment of human and mouse Zmaxl (LRP5) sequences as illustrated in Fig. 
20 26. 

TaqMan® quantitative reverse transcriptase-polymerase chain reaction (RT-PCR) 
analysis of RNA isolated from human cells was performed as described by the manufacturer 
(PE Applied Biosystems, Foster City, CA) using the following primers and probe set: 
Human Zmaxl -1/HBM-l: 
25 Forward Primer: 5 -GTC AGCCTGGAGG AGTTCTC A-3 ' (SEQ ID NO: 730) 

Reverse Primer: 5 -TC ACCCTTGGC AAT AC AGATGT-3 ' (SEQ ID NO:73 1) 
Probe: 6-FAM-5'-CCCACCCATGTGCCCGTGACA-3' (SEQ ID 

NO:732) 

Results from the experimental primers/probe set were normalized to human GAPDH 
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levels using the multiplex protocol with the human GAPDH control kit from PE Applied 
Biosystems. Species-specific TaqMan® quantitative RT-PCR analysis of RNA isolated from 
murine cells and tissues was performed as described by the manufacturer (PE Applied 
Biosystems, Foster City, CA) using the following primers and probes sets: 
5 Human Zmax-l/HBM-1: 

Forward Primer: 5-CGTGATTGCCGACGATCTC-3 1 (SEQ ID NO:733) 
Reverse Primer: 5 -TTCCGGCCGCT AGTCTTGT-3 1 (SEQ ID NO:734) 

Probe: e-FAM-S^CGCACCCGTTCGGTCTGACGCAGTAC^ 1 (SEQ 
ID NO:735) 
10 Mouse Zmax-l/HBM-1: 

Forward Primer: 5'-CTTTCCCCACGAGTATGTTGGT-3' (SEQ ID NO:736) 

Reverse Primer: 5- AAGGGACCGTGCTGTGAGC-3 1 (SEQ ID NO:737) 

Probe: 6-FAM-5 -AGCCCCTCATOTGCCTCTCAACTTCATAG-S 1 

(SEQ ID NO:738) 

1 5 Results from the experimental primers/probe sets were normalized to 1 8S ribosomal 

RNA levels using the multiplex protocol with the 18S ribosomal RNA control kit from PE 
Applied Biosystems. A summary of these results is presented in Figures 17-20. 

21.5 Production of transgenic mice 

20 21.5.1 DNA microinjection 

Transgene fragments for micro-injection were first purified on 1% agarose gels 
according to the GELase protocol from Epicentre Technologies. Fragments were then further 
purified on cesium chloride density gradients and extensively dialyzed against 5 mM Tris 
(pH 7.4), and 0.1 mM EDTA. 

25 Linearized DNA was microinjected into mouse embryos according to standard 

procedures. DNA was injected into primarily the male pronucleus of fertilized C57BL/6T 
mouse embryos. Injected embryos (n=20-35) were transferred to the oviduct (unilaterally) of 
day 0.5 post coitum pseudopregnant Swiss Webster embryo recipients. Offspring were tail- 
biopsied and genotyped at age 10-14 days. 
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21.6 Production of gene-targeted transgenic mice 

21.6.1 Gene targeting vectors and probes 

Two gene-targeting vectors were constructed for modification of the Zmaxl (LRP5) 
gene in embryonic stem (ES) cells. The two constructs, illustrated in Figure 16, designated as 
5 Zmaxl-KI/KO A&B were designed to generate two types of mutations, a knock-out (KO) of 
the Zmaxl gene and a Cre recombinase dependent knock-in (KI) of a nucleotide substitution 
in order to create a mouse model (i.e., glycine 170 to valine amino acid substitution in mouse 
Zmaxl, of the HBM kindred. 

Both gene-targeting vectors were constructed using genomic DNA of the mouse 
10 genomic DNA BAC clone 473P5 (Genbank Accession No. AZ095413) containing the first 
five exons of the mouse Zmaxl gene. This clone was isolated by Research Genetics 
(Huntsviile, AL) from their mouse 129SvJ genomic BAC library using a polymerase chain 
reaction (PCR) screen for exon 3. A forward and reverse primers of 

Forward: 5 , -GAGCGGGCAGGGATGGATGG-3 f (SEQ ID NO: 739) 
1 5 Reverse: 5 - AGGTTGGC ACGGTGGATGAAGC-3 ' (SEQ ID NO: 740) 

were used to amplify exon 3 by PCR; the following thermal cycling conditions were 
employed: for thirty cycles, 95°C for 0.5 minute, 55°C for 1 minute and 72°C for 1 minute. 
Identity of this clone with mouse Zmaxl was confirmed by sequencing exon 3 using the BAC 
clone DNA as template. PCR products were cloned using the pGEM-T-easy T/A cloning kit. 

20 

21.6.2 Zmaxl (LRP5) knock-in/knock-out vector 

The organization of the genomic BAC clone 473P5 was characterized by Southern 
blot analysis using subcloned exon 1, exon 2 and exon 3 as probes and by sequencing the 
region spanning exon 1 through exon 5. Two different constructs were prepared for the 
25 Zmaxl KI/KO targeting. These constructs (A and B) differ only in flanking arms of 

homology. Construct (A) contains a 6.5 kb BstEIl-Xbal 5' arm of homology and a 1 kb 
Xbal-EcoRI 3' arm of homology; whereas, construct (B) contains a 1 kb 5' arm of homology 
and a 6.0 kb 3' arm of homology. The constructs were prepared by ligating short and long 
arms of homology to a LoxP flanked cassette containing the neomycin resistance gene (MC1- 
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Neo, Stratagene) and a synthetic transcriptional pause sequence (Promega). 

Both Zmaxl -KI/KO- targeting vectors (A and B) were modified to a G-to-T 
nucleotide substitution, encoding the G170V amino acid substitution, in exon 3. These 
modifications were introduced into by overlapping PCR mutagenesis using the wild type 
sequence of the short arm of homology as template. In addition, the 1 kb short ami of the 
Zmaxl -KI/KO (B) targeting vector was modified to include a 5' terminal Pmel restriction 
recognition site. The 5' overlapping fragment was made using the following forward primer 
and reverse mutagenic primer: 

Forward: 5 '-AAGCTTGTTT A A ACTGGGCATGGTGGCACATGGTTGTAAT-3 ' (SEQ 
ID NO: 741) 

Reverse: 5-GGGCTTCCACCCAGTCAGTCCAGTACATGTACCT-3' (SEQ ID NO: 
742). 

The thermal cycling conditions utilized for thirty cycles are 95°C for 0.5 minute, 55°C for 1 
minute and 72°C for 1 minute. The 3' fragment was made using the following forward and 
reverse primers: 

Forward: 5 '-CTGACTGGGIGGAAGCACCCCGGATCGAGC-3 ' (SEQ ID NO: 

743) 

Reverse (mutagenic): 5'-GAATTCATCGGTACCTGTGCGGCCGCTTCATTG -3 5 
(SEQ ID NO: 744). 

The thermal cycling conditions utilized for thirty cycles are 95°C for 0.5 minute, 55°C for 1 
minute and 72°C for 1 minute. The final overlapping PCR used 1 ml each of the 5' fragment 
and 3' fragment PCR reactions as template and amplification was performed using the 
forward and reverse primers of the 5 5 and 3 9 fragments respectively and the same thermal 
cycling parameters. The final PCR product was cloned using the pGEM-T-easy T/A cloning 
kit. The mutagenized exon 3 was excised from Zmaxl-KJJKO (B) and transferred to Zmaxl- 
KI/KO (A) as a 600 bp Bsm&l-Xbal fragment. 

Probes for screening for and characterization of Zmaxl -KI/KO (A) gene targeted ES 
cell clones are prepared by subcloning restriction fragments of BAC clone 473P5. The 5' 
outside probe is a 400 bp Nde-BstEIL fragment, and the 3 5 outside probe is a 500 bp EcdBl- 
BstXl fragment. 

154 



<WO 02092000A2J_> 



WO 02/092000 PCT/US02/14877 



The outside probes for Zmaxl -KI/KO (B) are prepared by PCR cloning genomic 
fragments flanking and immediately adjacent to the targeting vector region of homology. 
The 5' outside probe used for Zmaxl -KI/KO (B) is a 498 bp fragment generated using the 
forward primer of the sequence (5'-TGAGATGTCCTGTCTGTGGC-3'; SEQ ID NO: 745) 
5 and areverse primer of the sequence (5'-TCCTTCCTTCCCTACAGTTG-3'; SEQ ID NO: 
746). The thermal cycling conditions utilized with these probes for thirty cycles are: 95°C 
for 0.5 minute, 55°C for 1 minute and 72°C for 1 minute. The 3' outside probe is a 600 bp 
fragment generated using the forward primer of the sequence (5 '-CCTAAGGATCTCCTTGT 
GTCTGTGG-3'; SEQ ID NO: 747) and a reverse primer of the sequence (5'-CTGCAGCAG 

10 GTC AGT AGCCTGC-3 * ; SEQ ID NO: 748). The thermal cycling conditions utilized with 
these probes for thirty cycles were: 95°C for 0.5 minute, 55°C for 1 minute and 72°C for 1 
minute. Both probes are specific for the Zmaxl gene in genomic southern analysis. PCR 
products are cloned using the pGEM-T-easy T/A cloning kit. 

A probe for ribonuclease protection analysis of Zmaxl mRNA structure and 

1 5 transcription levels was prepared by PCR cloning a cDNA fragment containing exon 3 

through exon 4. The PCR reaction used a complete cDNA as template, a forward primer of 
the sequence (5 ' -TGAGATGTCCTGTCTGTGGC-3 * ; SEQ ID NO: 749), a reverse primer of 
the sequence (5'-TCCTTCCTTCCCTACAGTTG-3'; SEQ ID NO: 750) and the following 
thermal cycling conditions for thirty cycles; 95°C for 0.5 minute, 55°C for 1 minute and 72°C 

20 for 1 minute. The PCR product is cloned using the pGEM-T-easy T/A cloning kit. 

One skilled in the art could use similar protocols to generate other engineered KI- 
alleles to produce transgenic animals with an HBM-like phenotype. 

21.6.3 Gene targeting in ES cells 
25 For gene targeting, embryonic stem (ES) cells are electroporated with 50 mg of 

linearized targeting vector and selected in 200 mg/ml G418 for 7-10 days beginning the day 
after electroporation. G418 resistant clones are picked, expanded and cryopreserved. 
Resistant clones were screened for homologous recombination by an EcoIH genomic 
Southern restriction fragment length analysis using the 5' outside probe, which detects the 



155 



BNSDOCID: <WO 02092000 A2 I : 



WO 02/092000 PCT/US02/14877 



wild type and targeted alleles of Zmaxl as 4 kb and 5 kb fragments, respectively. Gene 
targeted ES cell clones are thawed, expanded, and characterized by Seal genomic restriction 
fragment length analysis using the 3' outside probe, which detects the wild type and targeted 
alleles of Zmaxl as 9 kb and 8 kb fragments, respectively. Gene targeted clones are also 
characterized by sequence analysis of Zmaxl exon 3 to ensure that the G to T substitution 
was included in homologous recombination. 

21.6.4 Production o f gene targeted mice hy blastocyst injection 
To generate chimeric mice, gene targeted ES cell clones are thawed, expanded and 9- 
14 ES cells injected into the blastocoel of 3.5 post coitum (p.c.) host C57BL/6 blastocysts. 
Injected blastocysts (12-17) are then transferred unilaterally into the uterus of 2.5 p.c. 
pseudopregnant Swiss Webster embryo recipients and allowed to develop to term. Chimeric 
males are back-crossed to 129SvEv females and tested for transmission of the targeted allele 
by PCR geneotyping with primers specific to the neomycin resistance gene. 

21-6.5 In Vitro deletion of the Neomycin resistance c assette via Cre rennmhin^. 

To generate Zmaxl (LRP5) KI mice from the Zmaxl KI/KO mice the Neomycin 
resistance (KO) cassette was deleted by micro injection of a Cre expressing plasmid (2 
mg/ml) into the male pronucleus of Zmaxl KI/KO pre-fusion zygotes. Deletion of the KO 
cassette was confirmed by PCR analysis of the cassette insertion site. 

21.6.6 Genotyping transgenic mios 

Genomic DNA was isolated from mouse tail snips by digestion in 500 ul buffer 
containing 50 mM Tris-HCl, (pH 7.2), 50 wM EDTA, (pH 8.0), 0.5% SDS and 0.8 mg/ml 
proteinase K. Samples are incubated at 55 °C with shaking overnight. A 10 \il aliquot was 
heat-inactivated at 99 °C for 5 minutes and diluted 1:20 in water. For PCR, 1 pi of the diluted 
DNA was amplified under the following conditions: Denature: 96°C for 4.5 min; 45 cycles: 
96°C for 30 sec; 63°C for 1 min; 72°C for 1 min; Extension: 72°C for 5 min; 4°C hold. 

The following primer sets are used for genotyping: 
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HBMMCBA: 

5' primers: 296 hp frag ment (SEQ ID NOS: 751-752 respectively) 
Forward: 5'-GCT TCT GGC GTG TGA CCG GCG-3 ' 
Reverse: 5'-GCC GCACAG CGC CAG CAG CAG C-3' 
5 V primers: 400 hp fragment (SEQ ID NOS: 753-754 respectively) 
Forward: 5'-CAC CCA CGC CCC ACA GCC AGT A-3' 
Reverse: 5'-ATT TGC CCT CCC ATA TGT CCT TCC-3' 
HBMMTIC: 

5' primers: 3S2 bp fragment (SEQ ID NOS: 755-756 respectively) 
10 Forward: 5 '-TTC CTC CCA GCC CTC CTC CAT CAG-3 ' 

Reverse: 5'-GCC GCA CAG CGC CAG CAG CAG C-3' 
3' primers: 524 hp fragment (SEQ ID NOS: 757-758 respectively) 

Forward: 5'-GAA TGG CGC CCC CGA CGA C-3' 

Reverse: 5'-GCT CCC ATT CAT CAG TTC CAT AGG-3' 

15 

21.6.7 Confirmation of genotype by Southern analysis 

Mouse genomic DNA was digested with EcoBI and probed with a 1 .0 kb SaH-BamYH 
restriction fragment from the Zmaxl cDNA. The probe hybridizes to a 5 kb fragment in 
transgene positive animals. 

20 

21.7 Phenotyping 

Both in vivo and ex vivo assays are used to evaluate the phenotype in transgenic mice. 
Two strains of wild-type mice, namely C57BL/6 and 129 SvEv, are studied to provide control 
data for phenotypic evaluation in transgenic and gene-targeted mice. In addition, non- 
25 transgenic littermate animals are used as controls. 

21.7.1 hi vivo analysis 

Spinal bone mineral content (BMC) and bone mineral density (BMD) measurements 
performed at Creighton University (Omaha, Nebraska) were made by DXA using a Norland 
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Instruments densitometer (Norland XR2600 Densitometer, Dual Energy X-ray 
Absorptiometry, DXA). Spinal BMC and BMD at other locations used the machinery 
available. There are estimated to be 800 DXA machines currently operating in the U.S. Most 
larger cities have offices or imaging centers which have DXA capabilities, usually a Lunar or 
5 Hologic machine. Each location that provided spine BMC and BMD data included copies of 
the printouts from their machines to provide verification that the regions of interest for 
measurement of BMD have been chosen appropriately. Complete clinical histories and 
skeletal radiographs were obtained. 

The HBM phenotype (and HBM like phenotype which is also included when 
10 discussing the HBM phenotype) in human and animal subjects, preferably humans, can be 
described using criteria such as: very high spinal BMD; a clinical history devoid of any 
known high bone mass syndrome; and skeletal radiographs showing a normal shape of the 
appendicular skeleton. 

pDXA: Wild-type and transgenic mice are anesthetized, weighed and whole-body X- 
15 ray scans of the skeleton generated using the LUNAR small animal PDOmus device. Scans 
are begun when the mice are weaned (i.e., at 3 weeks of age) and repeated at 2 week 
intervals. Wild-type animals are scanned at 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 27, and 29 
weeks. Scanning of transgenic animals would be performed for periods up to 17 weeks. 
Scans are analysed for BMD (bone mineral density), BMC (bone mineral content), TTM 
20 (total tissue mass), and % fat for various body regions. 

Faxitron radiographs: Following pDXA scanning of anesthetized animals, an 
additional X-ray was taken using a Faxitron device allowing measurement of bone size. 

Calcein labeling: Animals are dosed with 15 mg/kg calcein intraperitoneally on two 
consecutive occasions. The first dose was given 9 days before euthanasia and the second 
25 given 2 days before euthanasia allowing measurement of bone formation rate. 

21.7.2 Ex vivn analy,^ 

RNA isolation: Total RNA was isolated from tibia and other tissues using TRIzol® to 
determine mRNA expression. 
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pQCT: The right femur was cleaned of soft tissue and stored in 70% ethanol for 
determination of total and trabecular density of the distal metaphysis and cortical density of 
the mid-shaft. 

MicroCT: The right femur was used to determine trabecular indices of the distal 
5 metaphysis. 

Histology: The right femur was used to determine bone area and static and dynamic 
parameters of the distal metaphysis. 

Bending strength: The left femur was cleaned of soft tissue and stored at -20 °C prior 
to analysis of 3-point bending strength of the mid-shaft. 
10 Compressive strength of vertebra: The entire spine was removed from T10-L6-7. 

Soft tissue was left on and the spine frozen at -20°C until analysis. Compressive strength was 
measured at the L5 vertebra. 

Serum: Animals are euthanized and serum prepared from blood to measure total 
cholesterol, triglycerides, osteocalcin and other biochemical markers. 
15 Lysis: Examples include immunocytochemistry, such as in situ hybridization of 

osteogenic markers and TUTSTEL staining of cells undergoing apoptosis. 

21.8 Results 

21.8.1 Confirmation of expression from transge nic plasmid constructs 
20 The HBM (HBMMCBA and HBMMTIC) and wild-type (Zmaxl WTCBA and 

Zmaxl WTTIC) plasmid constructs were transiently transfected into HOB-02-02 cells, which 
have a very low endogenous level of Zmaxl (LRP5) expression. Two days after transfection, 
RNA was isolated and TaqMan® quantitative RT-PCR was performed to determine the 
mRNA levels of Zmaxl/HBM in the cells. To control for contaminating plasmid DNA, PCR 
25 was performed with or without the prior RT step, in the absence of the RT step, only very low 
levels of Zmaxl/HBM mRNA were detected. However, with the RT step, a 1000-fold 
increase in HBM and Zmaxl mRNA was observed in cells transfected with CMVfcActin- 
promoter constructs as compared to those transfected with the CMV-b-Galactosidase control. 
The type I collagen promoter constructs showed approximately 10-fold increases in HBM 
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and Zmaxl mRNAs, which is consistent with the weaker nature of this promoter compared to 
the CMVpActin promoter. See Fig. 17. 

21.8.2 S pecies specific TaqMan® reag ents for HRM/Zmax1 expression 
Species specific TaqMan® primer and probe sets for Zmaxl/HBM were developed. 
In a series of experiments using HOB cells and mouse MC-3T3-E1 osteoblastic cells, 
Zmaxl/HBM mRNA was measured in a mouse background, and vise versa. These reagents 
useful for the detection and quantization of species-specific expression. As demonstrated in 
Fig. 18, the primer sets are species specific in the mouse and human cell lines. Further, Fig. 
19 demonstrates the quantitative measurement of human Zmaxl RNA in a background of 
mouse RNA. These TaqMan® sets can be used to determine the levels of human or mouse 
HBM or Zmaxl (or other HBM-like variant) message that are being expressed in the mouse 
transgenic lines. 

The species-specific TaqMan® reagents are novel tools for the characterization of both 
endogenous Zmaxl mRNA levels and human Zmaxl/HBM mRNA levels in the transgenic 
mouse tissues. These tools have several advantages over other conventional methods, such as 
Northern hybridization and standard RT-PCR. Some of these advantages are as follows: (1) 
specificity, since only a small region (<100 bps) is amplified primers and probes are chosen 
to sequence regions predicted to have no or minimal cross-reactivity; (2) speed, since the 
procedure is less labor intensive; (3) accuracy, since it is truly quantitative; and (4) 
sensitivity, since it requires only small amounts of starting material (i.e., RNA) and the 
signal-to-noise ratio is high. These advantages are especially important for analyzing mRNA 
levels in bone, because it is difficult to obtain large amounts of RNA from bone. Thus, the 
primer sets developed for TaqMan® analysis of a HBM and Zmaxl expression are important 
embodiments of the present invention. One skilled in the art will recognize that the primers 
described here are preferred embodiments; modifications such as extension or truncation of a 
primer or base substitution are encompassed by the present invention so long as the resultant 
nucleic acid continues to perform substantially the same function. 
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21.8.3 HBM expression in transgenic mice 

Eight transgenic founder animals were produced for the CMV&Actin (HBMMCBA) 
construct and a breeding program initiated to establish lines. Expression of mRNA 
determined by TaqMan® analysis, shown in Fig. 20, showed variable levels of bone 
5 expression in 4 lines. In tibia, expression levels (relative to endogenous Zmaxl (LRP5) in 

HOB-03-C5 cells) showed the following range: line 18 (xl0-l 1 fold); line 2 (x7-10 fold); line 
13 (x 1-2 fold) and line 28 (xl fold). Expression was also detected in other tissues as 
expected based on the known activity of the promoter. For lines 2 and 13, the highest levels 
of HBM expression were found in the heart. A TaqMan® genotyping assay will screen for 
10 potential homozygous animals. 

Six transgenic founder animals were produced for the type I collagen (HBMMTIC) 
construct, and a breeding program initiated to establish lines. Expression of mRNA was 
found in two lines initially tested. In line 19, expression was 7-8 fold and 19-20 fold greater 
than Zmaxl in HOB-03-C5 cells in tibia and femur, respectively. In line 35, a low level of 
1 5 expression was detected in tibia and femur. 

21.9 In vivo pDXA in HBM transgenic mice? 
21.9.1 HBMMCBA construct 

Initial analysis of limited numbers of mice, illustrated in Fig. 21 (A-C), at the 5 week 
20 and 9 week time-points showed that one line tested to date had greater BMD values compared 
to control. At 5 weeks, HBMMCBA line 2 (n=l 1) BMD in femur, spine and total body was 
21%, 24% and 10% greater respectively, than wild-type control. At 9 weeks (n=3), these 
increases in BMD amounted to 19%, 32% and 12%, respectively. 

25 21.9.2 HBMMTTC construct 

Analysis of limited numbers of mice, illustrated in Fig. 21 (D-F), at the 5 week and 9 
week time points showed that two lines tested to date had greater BMD values as compared 
to control. At 5 weeks, HBMMTIC line 19 (n=5) BMD in femur, spine and total body was 
63%, 70% and 41% greater respectively than the wild-type control. At 9 weeks (n=2), these 
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increases in BMD amounted to 52%, 64% and 37% respectively. 

At 5 weeks, HBMMTIC line 35 (n=l) BMD in femur, spine and total body was 4%, 
47% and 6% greater, respectively than the wild-type control. At 9 weeks (n=3), these 
increases in BMD amounted to 32%, 43% and 18% respectively. 

These results indicate that three lines analyzed to date show evidence for a high bone 
mass phenotype. There appeared to be no obvious correlation between levels of mRNA 
expression and BMD phenotype from the limited numbers of lines and animals studied to 
date. For example, HBMMCBA line 2 and HBMMTIC line 19 have similar levels ofHBM 
mRNA in tibia, but the phenotype is more evident in line 19. Also, HBMMTIC line 35 
shows a very low level of expression when compared to HBMMCBA line 2, but HBMMTIC 
line 35 appears to have a stronger phenotype. These observations point to possible 
differences in cellular expression that may impact the phenotype. 

Overall, the BMD results from the transgenic mice show similarities in magnitude to 
the phenotype observed in the HBM affected kindred (Johnson et aL, 1997 \ Am. J. Hum. 
Genetics, 60:1326-1332). For example, spinal BMD measured in affected individuals is 
approximately 34-63% greater than non-afifected family members. The data for spinal BMD 
from the transgenic animals ranges from -30-70% greater than normal at 9 weeks of age. 

21.9.3 Ex vivo analysis of transgenic mis fi 

In order to further examine increases in bone density that were detected in select 
transgenic lines through monitoring of the animals by non-invasive bone imaging, necropsies 
were performed on animals of these lines at 5 and 9 weeks of age for direct bone 
densitometry and histologic analysis. The left femur was isolated, cleaned and positioned in 
an XCT Research peripheral Quantitative Computed Tomograph (pQCT; Stradtec 
Medizintechnik, Pforzheim, Germany). The distal end of the femur was located and pQCT 
scanning was initiated 2.5 mm proximal from this point for total and trabecular 
measurements. The pQCT scan for cortical measurements was initiated 3.5 mm proximal 
from the first scan (i.e., 6 mm proximal from the distal end). The pQCT scans were 0.5 mm 
thick, had a voxel (i.e., three dimensional pixel) size of 0.07 mm, and consisted of 360 
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projections through the slice. After the scans were completed, the images were displayed on 
the monitor and a region of interest, including the entire femur for each scan, was outlined. 
The soft tissue was automatically removed using an iterative algorithm, and the density of the 
remaining bone (total density) in the first slice was determined. The outer 55% of the bone 
5 was then peeled away in a concentric spiral and the density of the remaining bone (trabecular 
density) of the first slice was reported in mg/cm 3 . In the second slice, the boundary between 
cortical and trabecular bone was determined using an iterative algorithm, and the density of 
the cortical bone was determined. 

Analysis of Line 2 Fl CMV6Actin-HBM 5 week old transgenic animals revealed that 

10 total density, trabecular density and cortical density, were 20%, 37% and 4% higher, 

respectively, in the transgenic male mice versus the non-transgenic males. In 5 week old 
Line 19 type I collagen-HBM transgenic males, an even more dramatic increase in bone 
density over their non-transgenic littermates was evident. Total density, trabecular density i 
and cortical density were 53%, 104 % and 5% higher, respectively. In the Line 19 animals, 

15 the phenotype was found to be maintained at 9 weeks of age with total, trabecular and cortical 
bone density being 44%, 101% and 6% higher than the non-transgenic littermates. At 17 
weeks, total and trabecular density were increased 46%, 202%, respectively. The effects on 
the total trabecullar parameters in line 19 at all three time points were statistically 
significantly higher (p<0.001). A somewhat different pattern of bone phenotypic expression 

20 was evident from males of type I collagen-HBM transgenic Line 35. At 5 weeks of age total, 
trabecular and cortical density were only marginally higher (7%, 4% and 4%, respectively). 
However, at 9 weeks of age a clear and statistically significant (P < 0.05, 0.005, and 0.001 
respectively) increase in these parameters became evident (i.e., 25%, 37% and 4%, 
respectively). The occurrence of different patterns of age-related expression of the phenotype 

25 is not unexpected, particularly with the "bone specific" type I collagen transgene, which is 
influenced by stage of bone cell differentiation. Both Line 2 and Line 19 animals at 5 weeks 
of age express comparable levels of HBM mRNA in tibia samples, and these levels are 
significantly greater (>7-8-fold) than other lines that show no apparent bone phenotype at this 
age. Line 19, which is driven by the type I collagen promoter, unlike line 2, shows very low 
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expression of the transgene in tissues other than bone. At 5 weeks of age, Line 35 animals 
show low level expression in bone and none in other tissues. Imnaunohistochemistry of 
calvarial bone sections using an HBM/Zmaxl specific antibody reveals much more intense 
staining in bone cells of transgenic animals from Lines 2 and 19 at 5 weeks of age and from 
Line 35 at 9 weeks of age versus their non-transgenic littermates. 

The findings revealed by pQCT analysis were further examined under greater 
resolution using /^CT instrumentation (Scanco). The femur was positioned such that the 
region being imaged includes the distal end of the femur extending approximately 4 mm 
proximally with the view being perpendicular to the axis of the articulating cartilage. The 
reference line for beginning the juCT measurement was placed to minimally overlap the 
growth plate and extends proximally for 200 scan slices (9 mm thickness). After completing 
the //CT measurement, the first slice in which the condyles have fully merged was identified. 
A region of interest was outlined to include a maximum amount of the trabecular space, while 
excluding the cortex. For the first thirty slices, regions of interest were drawn every five 
slices and merged. For the remaining 105 slices, regions of interest were drawn every 10-20 
slices. The more regular the trabecular space, the less frequently a region of interest needed 
to be drawn. Each region of interest was merged with its predecessor after it was drawn. 
After regions of interest had been established for all 135 slices, three dimensional evaluation 
was performed using a threshold setting of 350. 

The increased bone densities identified by pQCT were confirmed and extended by 
>wCT to include elements of bone architecture. In the Line 2 transgenic animals, juCT bone 
volume/total volume, connectivity density and trabecular thickness were 50%, 83% and 12% 
higher, respectively. Both the connectivity density and trabecular thickness indices suggest 
that the increased density is also associated with increased structural strength. Bone surface/ 
bone volume was lower by 17% in the transgenic males, which may suggest that there may be 
fewer resorptive surfaces and pits. The trabecular bone response was further confirmed by 
histological evaluation of non-decalcified, Goldner's stained sections, which revealed 36% 
greater bone mineral area in the distal femoral metaphysis of the transgenic males. Dynamic 
histomorphometric analysis revealed that a substantial increase in bone mineral apposition 

164 



02092000A2_I_> 



WO 02/092000 PCT/US02/14877 



rate (+100%), as determined by calcein double labeling, may be partially responsible for the 
increased bone in the transgenics. The dramatic effects evident by pQCT on trabecular bone 
in Line 19 were supported by /uCT evaluation where bone volume/total volume, trabecular 
number, trabecular thickness and connectivity density were found to be 130%, 45%, 30% and 
5 121% higher, respectively, in the transgenic males. As seen in Line 2, the bone surface/ bone 
volume was lower in the transgenic males (-36%). All of these effects were statistically 
significant with p < 0.01. The bone phenotype seen at 5 weeks of age in Line 19 was 
maintained in 9 week-old animals where bone volume/total volume, trabecular number and 
connectivity density were 125%, 38% and 1 10% higher than in the non-transgenic 
10 littermates. 

juCT analysis of the Line 35 transgenics revealed a somewhat different pattern than 
the other two lines. In contrast to only modestly increased density indicated by pQCT in 5 
week-old females from Line 35, a statistically significant effect (p < 0.01) was seen with 
mCT, which has greater image resolution and encompasses a larger volumetric sample. Bone 

15 volume/total volume, trabecular thickness and connectivity density were 35%, 9% and 27% : 
higher. A similar result was seen in 5 week-old males from Line 35 where bone volume/total 
volume and connectivity density increases of 37% and 45%, respectively, were evident by 
juCT analysis, where only slight increases were revealed by pQCT. The differences between 
the Line 35 transgenic males and their non-transgenic littermates appeared to increase with 

20 age such that statistically significant increases in total density (25%) and trabecular density 
(37%) were evident by pQCT at 9 weeks of age. The jj,CT results support an age-related 
divergence in bone phenotype in this line and show that differences between transgenic and 
non-transgenic animals, in terms of bone volume/total volume and connectivity density, had 
nearly doubled those seen at 5 weeks to 70% and 83%, respectively. The bone volume 

25 increases seen in the transgenic animals is in agreement with a significant increase in this 

parameter that was detected in a bone biopsy sample from an adult male affected member of 
the HBM kindred. The other parameters that were found to be affected in the transgenic lines 
may reflect changes that lead to an increased peak/adult bone mass, which in this strain of 
mice occurs between the ages of 17-20 weeks 
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The bone density and bone architectural changes seen in the over-expressing 
transgenic lines would suggest potentially greater bio-mechanical strength. This was tested 
directly by evaluating 3-point bending strength of femurs from 5 week old Line 19 males. 
The femora were cleaned of soft tissue and the femoral length measured using a digital 
5 caliper. Periosteal and endosteal circumferences, as well as cortical thickness, were measured 
6 mm from the distal end of the bone using pQCT. The femur was then placed on a fixture so 
that the center of mid-shaft was at an equal distance from fixed supports located 5 mm apart. 
The cross bar of an Instron 5543 load device was placed over the mid-shaft and a force 
applied at a speed of 1 mm/minute until fracture occurred. A force vs. displacement curve 
1 0 was generated and peak load determined using Instron Merlin software. There was a 75% 

increase (p<0.01) in strength that appears to be due to an increase in periosteal circumference 
leading to an increase in cortical thickness. Thus, it appears that the changes in bone density 
and bone geometry, as seen in the HBM transgenic animals, do translate into increases in 
biomechanical strength. 

15 In view of the association of HBM/Zmaxl within the class of LDL related receptor 

proteins, it was of interest to determine whether the mutation might affect lipid profiles. 
Indeed, lipid studies in the HBM kindred (i.e., 8 affected and 7 unaffected members) have 
revealed that triglyceride and VLDL levels are statistically lower in the affected members. 
Serum samples from the transgenic lines were analyzed on a Hitachi 91 1 instrument using 

20 Boehringer Mannheim (for Cholesterol) and Roche (for triglycerides) reagents. The 

cholesterol was measured via o-quinone imine dye (which is formed following enzymatic 
reactions with cholesterol) photometrically at 505 nm at 37 °C. Enzymatic methods for 
triglyceride measurements are based on determination of the glycerol part of triglyceride after 
hydrolysis of triglycerides and fatty acids. The end dye product of enzymatic reaction was 

25 measured at 505 nm. In 5 week old male Line 2 transgenics, although serum cholesterol was 
only slightly reduced, serum triglyceride levels were reduced by 26% in the transgenics 
versus their non-transgenic littermates. In a limited sample of Line 2 animals at 9 weeks of 
age, triglyceride levels remained 20% lower. Similarly, at 5 weeks of age triglyceride levels 
in male transgenics from Line 19 were 32% lower. In contrast, at 5 weeks of age both male 
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and female transgenics of Line 35 did not have lower triglyceride levels. The fact that the 5 
week old Line 35 animals did have statistically greater bone volume/total volume suggests 
that the lipid change may not be directly related to the skeletal phenotype. This would appear 
to be supported by the fact that the Line 35 animals at 9 weeks of age had only slightly 
5 reduced triglyceride levels (1 1%) but exhibited substantially higher bone density than at 5 
weeks of age. Due to the different levels and sites of expression of the transgene in these 
lines we can not rule out the possibility that serum lipid levels could serve as a surrogate 
marker for agents favorably affecting a bone phenotype through HBM/Zmaxl. 

These and other transgenic lines based on HBM or HBM-like genes will serve as 

10 valuable models for exploring the nature of bone homeostasis. Bone density in all species 
accommodates to its customary loading conditions. In the HBM kindred and in the 
transgenic lines, the sensor/effector systems of the skeleton appear to perceive greater load 
signals resulting in greater bone density. Experimental models have been established 
showing that increased bone loading can lead to increased bone density and that unloading or 

15 disuse leads to a loss of bone density. Evaluating the histological, biochemical and genetic:/ 
responses of the skeleton of the transgenic animals in these experimental paradigms will yield 
much information regarding the sensor/effector system responsible for bone homeostasis. 
The application of the transgenic animals in other established models of altered bone 
turnover, including but not limited to steroid deficiency-induced osteopenia and aging-related 

20 osteopenia will provide further insight into the role of Zmaxl in bone homeostasis and the 
nature of the favorable changes induced by the HBM mutation. 

21.9.4 HBM gene-targeting 

The Zmaxl KI/KO gene-targeting vector is electroporated into 129 SvEv, C57BL/6 
25 ES and 129 ES cells. Restriction fragment length analysis of genomic DNA and sequencing 
of PGR amplified fragments can be used to identify gene targeted clones. The knock-in 
version of the gene-targeting vector allows for the introduction of the HBM mutation into the 
endogenous Zmax (LRP5) genomic locus with minimal impact on the mouse genome. It 
permits the production of the HBM protein in a more natural environment, i.e. not in an over- 
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expression model such as the transgenic mice or transfected cell lines. The knock-out version 
of the gene-targeting vector was engineered to contain a transcriptional stop sequence that has 
the potential to result in loss of one functional LRP5 allele. Breeding heterozygous animals 
with this mutation would lead to the production of embryos homozygous for the null allele. 
5 In a different design of the gene-targeting vector, lox P sites could be positioned in such a 
way that would facilitate production of a conditional knock-out of the endogenous LRP5 
gene. In the presence of Cre recombinase, a critical region of the LRP5 gene would be 
deleted in between the lox P sites, thus resulting in the potential loss of one functional allele. 
Animal breeding would then be used to create homozygotes with a null allele. Other 

10 recombinase enzyme systems, such as flp recombinase in combination with cognate fit sites, 
could be used to create the deletion. The recombinase could be administered in a number of 
ways as described earlier, including plasmid injection into embryos and use of transgenic 
animals expressing Cre. The promoter used to drive expression of Cre could be chosen in a 
maimer that would result in ubiquitous or tissue-specific deletion of the LRP5 gene thus 

15 resulting in a conditional knockout. In a further embodiment expression of the Cre enzyme 
itself could be made conditional using inducible systems such as GeneSwitch and 
Tetracycline paradigms. 

21.9.5 Uses of transgenic animals and cells 

20 The transgenic animals and cells of the present invention are useful tools in methods 

for identifying surrogate markers for the HBM phenotype. The surrogate markers provided 
by the present invention are also useful tools for the assessment and screening of prospective 
treatments. Individuals carrying the HBM gene have elevated bone mass. The HBM gene 
causes this phenotype by altering the activities, levels, expression patterns, and modification 

25 states of other molecules involved in bone development. Using a variety of established 
techniques, it is possible to identify molecules, preferably proteins or mRNAs, whose 
activities, levels, expression patterns, and modification states are different between systems 
containing the Zmaxl gene and systems containing the HBM gene. Such systems can be, for 
example, cell-free extracts, cells, tissues or living organisms, such as mice or humans. For a 
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mutant form of Zmaxl, a complete deletion of Zmaxl, mutations lacking the extracellular or 
intracellular portion of the protein, or any other mutation in the Zmaxl gene may be used. It 
is also possible to use expression of antisense Zmaxl RNA or oligonucleotides to inhibit 
production of the Zmaxl protein. For a mutant form of HBM, a complete deletion of HBM, 
5 mutations lacking the extracellular or intracellular portion of the HBM protein, or any other 
mutation in the HBM gene may be used. It is also possible to use expression of antisense 
HBM RNA or oligonucleotides to inhibit production of the HBM protein. 

Molecules identified by comparison of Zmaxl systems and HBM systems can be used 
as surrogate markers in pharmaceutical development or in diagnosis of human or animal bone 

10 disease. Alternatively, such molecules may be used in treatment of bone disease. See, 
Schena et al, 1995 Science, 270: 467-70. 

For example, a transgenic mouse carrying the HBM gene or HBM-like variant in the 
mouse homologue locus is constructed. A mouse of the genotype HBM/+ is viable, healthy 
and has elevated bone mass. To identify surrogate markers for elevated bone mass, HBM/4- 

1 5 (i.e., heterozygous) and isogenic +/+ (i.e., wild-type) mice are sacrificed. Bone tissue mRNA 
is extracted from each animal, and a "gene chip" corresponding to mRNAs expressed in the- 
+/+ individual is constructed. mRNA from different tissues is isolated from animals of each 
genotype, reverse-transcribed, fluorescently labeled, and then hybridized to gene fragments 
affixed to a solid support. The ratio of fluorescent intensity between the two populations is 

20 indicative of the relative abundance of the specific mRNAs in the +/+ and HBM/+ animals. 
Alternatively, mRNA may be isolated from wild-type and transgenic animals. cDNA 
prepared from these samples is transcribed in vitro to obtain labeled mRNA for use on 
custom made or commercially available gene array chips such as are manufactured by 
Affymetrix. Sets of genes with altered expression as a function of phenotype may be 

25 identified be a variety of routine computational analyses. Genes encoding mRNA over- and 
under-expressed relative to the wild-type control are candidates for genes coordinately 
regulated by the HBM gene or HBM-like gene. 

One standard procedure for identification of new proteins that are part of the same 
signaling cascade as an already-discovered protein is as follows. Cells are treated with 
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radioactive phosphorous, and the already-discovered protein is manipulated to be more or less 
active. The phosphorylation state of other proteins in the cell is then monitored by 
polyacrylamide gel electrophoresis and autoradiography, or similar techniques. Levels of 
activity of the known protein may be manipulated by many methods, including, for example, 
comparing wild-type mutant proteins using specific inhibitors such as drugs or antibodies, 
simply adding or not adding a known extracellular protein, or using antisense inhibition of 
the expression of the known protein (Tamura et aL, 1998 Science, 280: 1614-7; Meng, 1998 
EMBOJ., 17: 4391-403; Cooper et aL, 1982 Cell 1: 263-73). 

In another example, proteins with different levels of phosphorylation are identified in 
TE85 osteosarcoma cells expressing either a sense or antisense cDNA for Zmaxl. TE85 cells 
normally express high levels of Zmaxl (Dong et aL, 1998 Biochem. & Biophys. Res. Comm., 
251 : 784-90). Cells containing the sense construct express even higher levels of Zmaxl, 
while cells expressing the antisense construct express lower levels. Cells are grown in the 
presence of 32 P, harvested, lysed, and the lysates run on SDS polyacrylamide gels to separate 
proteins, and the gels subjected to autoradiography (Ausubel et aL, Current Protocols in 
Molecular Biology, John Wiley & Sons (1997)). Bands that differ in intensity between the 
sense and antisense cell lines represent phosphoproteins whose phosphorylation state or 
absolute level varies in response to levels of Zmaxl. As an alternative to the 32 P-labeling, 
unlabeled proteins may be separated by SDS-PAGE and subjected to immunoblotting, using 
the commercially available anti-phosphotyrosine antibody as a probe (Thomas et aL, 1995 
Nature, 376: 267-71). As an alternative to the expression of antisense RNA, transfection 
with chemically modified antisense oligonucleotides can be used (Woolf et aL, 1990 Nucleic 
Acids Res., 18: 1763-9). 

Many bone disorders, such as osteoporosis, have a slow onset and a slow response to 
treatment. It is therefore useful to develop surrogate markers for bone development and 
mineralization. Such markers can be useful in developing treatments for bone disorders, and 
for diagnosing patients who may be at risk for later development of bone disorders. 
Examples of preferred markers are N- and C-terminal telopeptide markers described, for 
example, in U.S. Patent Nos. 5,455,179, 5,641,837 and 5,652,1 12, the disclosures of which 
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are incorporated by reference herein in their entirety. In the area of HTV disease, CD4 counts 
and viral load are useful surrogate markers for disease progression (Vlahov et al. 9 1998 
JAMA, 279(1): 35-40). There is a need for analogous surrogate markers in the area of bone 
disease. 

5 A surrogate marker can be any characteristic that is easily tested and relatively 

insensitive to non-specific influences. For example, a surrogate marker can be a molecule 
such as a protein or mRNA in a tissue or in blood serum. Alternatively, a surrogate marker 
may be a diagnostic sign such as sensitivity to pain, a reflex response or the like. 

In yet another example, surrogate markers for elevated bone mass are identified using 

10 a pedigree of humans carrying the HBM gene or HBM-Uke gene. Blood samples are 

withdrawn from three individuals that carry the HBM gene or HBM-Kke gene, and from three 
closely related individuals that do not. Proteins in the serum from these individuals are 
electrophoresed on a two dimensional gel system, in which one dimension separates proteins 
by size, and another dimension separates proteins by isoelectric point (Epstein et aL, 1996 

15 Electrophoresis 17: 1655-70). Spots corresponding to proteins are identified. A few spots 
are expected to be present in different amounts or in slightly different positions for the HBM 
individuals compared to their normal relatives. These spots correspond to proteins that are 
candidate surrogate markers. The identities of the proteins are determined by 
microsequencing, and antibodies to the proteins can be produced by standard methods for use 

20 in diagnostic testing procedures. Diagnostic assays for HBM proteins or other candidate 
surrogate markers include using antibodies described in this invention and a reporter 
molecule to detect HBM in human body fluids, membranes, bones, cells, tissues or extracts 
thereof. The antibodies can be labeled by joining them covalently or noncovalently with a 
substance that provides a detectable signal. In many scientific and patent literature, a variety 

25 of reporter molecules or labels are described including radionuclides, enzymes, fluorescent, 
chemiluminescent or chromogenic agents (U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 
3,996,345; 4,277,437; 4,275,149; and 4,366,241). The transgenic or genetically modified 
animals can also serve in a method for surrogate marker identification. 

Using these antibodies, the levels of candidate surrogate markers are measured in 
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normal individuals and in patients suffering from a bone disorder, such as osteoporosis, 
osteoporosis pseudoglioma, Engelmann's disease, Ribbing's disease, hyperphosphatasemia, 
Van Buchem's disease, melorheostosis, osteopetrosis, pychodysostosis, sclerosteosis, 
osteopoikilosis, acromegaly, Paget's disease, fibrous dysplasia, tubular stenosis, osteogenesis 
imperfecta, hypoparathyroidism, pseudohypoparathyroidism, pseudopseudohypoparathyroid- 
ism, primary and secondary hyperparathyroidism and associated syndromes, hypercalciuria, 
medullary carcinoma of the thyroid gland, osteomalacia and other diseases. Techniques for 
measuring levels of protein in serum in a clinical setting using antibodies are well 
established. A protein that is consistently present in higher or lower levels in individuals 
carrying a particular disease or type of disease is a useful surrogate marker. 

A surrogate marker can be used in diagnosis of a bone disorder. For example, 
consider a child that presents to a physician with a high frequency of bone fracture. The 
underlying cause may be child abuse, inappropriate behavior by the child, or a bone disorder. 
To rapidly test for a bone disorder, the levels of the surrogate marker protein are measured 
using the antibody described above. 

Levels of modification states of surrogate markers can be measured as indicators of 
the likely effectiveness of a drug that is being developed. It is especially convenient to use 
surrogate markers in creating treatments for bone disorders, because alterations in bone 
development or mineralization may require a long time to be observed. For example, a set of 
bone mRNAs, termed the "HBM-inducible mRNA set" is found to be over-expressed in 
HBM/+ mice as compared to +/+ mice, as described above. Expression of this set can be 
used as a surrogate marker. Specifically, if treatment of +/+ mice with a compound results in 
over-expression of the HBM-inducible mRNA set, then that compound is considered a 
promising candidate for further development. 

This invention is particularly useful for screening compounds by using the Zmaxl or 
HBM protein or HBM-like proteins or binding fragments thereof in any of a variety of drug 
screening techniques. 

The Zmaxl or HBM protein or fragment employed in such a test may either be free in 
solution, affixed to a solid support, or borne on a cell surface. One method of drug screening 
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utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the protein or fragment, preferably in competitive binding assays. 
Such cells, either in viable or fixed form, can be used for standard binding assays. One may 
measure, for example, for the formation of complexes between a Zmaxl or HBM protein or 
5 fragment and the agent being tested, or examine the degree to which the formation of a 

complex between a Zmaxl or HBM protein or fragment and a known ligand is interfered with 
by the agent being tested. 

Thus, the present invention provides methods of screening for drugs comprising 
contacting such an agent with a Zmaxl (LRP5) or HBM protein or fragment thereof and 

10 assaying (i) for the presence of a complex between the agent and the Zmaxl or HBM protein 
or fragment, or (ii) for the presence of a complex between the Zmaxl or HBM protein or 
fragment and a ligand, by methods well known in the art. In such competitive binding assays 
the Zmaxl or HBM protein or fragment is typically labeled. Free Zmaxl or HBM protein or 
fragment is separated from that present in a proteinrprotein complex, and the amount of free 

15 (i.e., uncomplexed) label is a measure of the binding of the agent being tested to Zmaxl or 
HBM or its interference with Zmaxl or HBM: ligand binding, respectively. t 

Another technique for drug screening provides high throughput screening for 
compounds having suitable binding affinity to the Zmaxl or HBM proteins and is described 
in detail in WO 84/03564. Briefly stated, large numbers of different small peptide test 

20 compounds are synthesized on a solid substrate, such as plastic pins or some other surface. 
The peptide test compounds are reacted with Zmaxl or HBM proteins and washed. Bound 
Zmaxl or HBM protein is then detected by methods well known in the art. Purified Zmaxl 
or HBM can be coated directly onto plates for use in the aforementioned drug screening 
techniques. However, non-neutralizing antibodies to the protein can be used to capture 

25 antibodies to immobilize the Zmaxl or HBM protein on the solid phase. 

This invention also contemplates the use of competitive drug screening assays in 
which neutralizing antibodies capable of specifically binding the Zmaxl or HBM protein 
compete with a test compound for binding to the Zmaxl or HBM protein or fragments 
thereof. In this manner, the antibodies can be used to detect the presence of any peptide that 



173 



BNSDOCID: <WO 02092000A2_I_> 



WO 02/092000 



PCT/US02/14877 



shares one or more antigenic determinants of the Zmaxl or HBM protein. 

A further technique for drug screening involves the use of host eukaryotic cell lines or 
cells (such as described above) that have a nonfunctional Zmaxl or HBM gene. These host 
cell lines or cells are defective at the Zmaxl or HBM protein level. The host cell lines or 
cells are grown in the presence of drug compound. The rate of growth of the host cells is 
measured to determine if the compound is capable of regulating the growth of Zmaxl or 
HBM defective cells. 

The goal of rational drug design is to produce structural analogs of biologically active 
proteins of interest or of small molecules with which they interact (e.g., agonists, antagonists, 
inhibitors) in order to fashion drugs which are, for example, more active or stable forms of 
the protein, or which, e.g., enhance or interfere with the function of a protein in vivo. See, 
e.g., Hodgson, 1991 Bio/Technology, 9: 19-21. In one approach, one first determines the 
three-dimensional structure of a protein of interest (e.g., Zmaxl or HBM protein) or, for 
example, of the Zmaxl - or HBM-receptor or ligand complex, by x-ray crystallography, by 
computer modeling or most typically, by a combination of approaches. Less often, useful 
information regarding the structure of a protein may be gained by modeling based on the 
structure of homologous proteins. An example of rational drug design is the development of 
HIV protease inhibitors (Erickson et aL 9 1990 Science, 249: 527-33). In addition, peptides 
(e.g., Zmaxl or HBM protein) are analyzed by an alanine scan (Wells, 1991 Methods in 
EnzymoL, 202: 390-41 1). In this technique, an amino acid residue is replaced by Ala, and its 
effect on the peptide's activity is determined. Each of the amino acid residues of the peptide 
is analyzed in this manner to determine the important regions of the peptide. 

It is also possible to isolate a target-specific antibody, selected by a functional assay, 
and then to solve its crystal structure. In principle, this approach yields a pharmacore upon 
which subsequent drug design can be based. It is possible to bypass protein crystallography 
altogether by generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically 
active antibody. As a mirror image of a mirror image, the binding site of the anti-ids would 
be expected to be an analog of the original receptor. The anti-id could then be used to 
identify and isolate peptides from banks of chemically or biologically produced banks of 
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peptides. Selected peptides would then act as the pharmacore. 

Thus, one may design drugs which have, e.g., improved Zmaxl or HBM protein 
activity or stability or which act as inhibitors, agonists, antagonists, etc. of Zmaxl or HBM 
protein activity. By virtue of the availability of cloned Zmaxl or HBM sequences, sufficient 
5 amounts of the Zmaxl or HBM protein may be made available to perform such analytical 
studies as X-ray crystallography. In addition, the knowledge of the Zmaxl or HBM protein 
sequence provided herein will guide those employing computer modeling techniques in place 
of, or in addition to X-ray crystallography. 

Identified drug candidates (known as "leads") may be further studied by use of 

10 transgenic animals. The transgenic animals of the present invention are useful for creating an 
animal model of bone density modulation which may be used to test and refine drug leads. 
The transgenic animals described above represent a single example of the Zmaxl and HBM 
or HBM-like transgenic animals contemplated herein. One skilled in the art is aware of 
variations and the considerations which will be routinely applied in modifying the present 

15 invention to a specific purpose. Examples of the development of transgenic animal models 
are given for example in Strategies in Transgenic Animal Science (1995, Monastersky and 
Robl Eds., Washington, DC: American Society for Microbiology) and references therein 
which are all incorporated herein by reference in their entirety. 

As an example, at least two groups of transgenic animals can be created as described, 

20 so that one group expresses HBM and another group expresses Zmaxl. These animals can be 
treated with the candidate drug for some time spanning from a few days to the remainder of 
the animal f s life-span. The animals are monitored for changes in bone mass and/or surrogate 
markers for the HBM phenotype. The transgenic animals used in such a study may express 
human HBM protein and Zmaxl protein or the homologous HBM and Zmaxl proteins 

25 defined for each species or variants thereof. Expression may be driven by a ubiquitous 

promoter or a bone specific promoter as would be known. It will be informative to compare 
groups of animals utilizing different promoters. 

The transgenic animals of the method according to the present inventions may also 
comprise knock-in (KI) and/or knock-out (KO) animals, such as mice, which express HBM, 
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Zmaxl, or neither under the control of the animal's native promoter. Such animals may be 
created by homologous recombination in ES cells as described above (and elsewhere in the 
literature of the art such as, for example, U.S. Patent Nos. 6,187,991 and 6,187,992 and 
references cited therein which are incorporated herein in their entirety). The experimental 
groups of transgenic animals treated with candidate drugs may be monitored by non-invasive 
means, by the monitoring of surrogate markers as described above, and/or by ex vivo analysis 
of bones from sacrificed animals at given time-points. 

Likewise the effect of such treatments as dietary control (e.g. varying intake of 
vitamins, minerals, proteins, lipids, etc.), ovariectomy, direct administration of all or part of 
purified HBM or Zmaxl proteins, administration of antisense nucleotides, antibodies against 
Zmaxl or gene therapy in adults may be investigated by systematic administration of the 
treatment to transgenic animals according to the invention. Such treatments may include, 
administration of estrogens, tamoxifen, raloxifene, (or other selective estrogen modulators, 
SERMs), vitamin D analogs, calcitonin, cathepsin K inhibitors, statins (e.g. simvastatin, 
pravastatin, and lovastatin), bis-phosphonates, parathyroid hormone (PTH), bone 
morphogenetic proteins (BMP) as described in U.S. Patent Nos. 6,190,880 and 5,866,364, 
and combinations of the above compounds. 

In view of the homology between Zmaxl and LRP6 and to the LDL receptor and the 
further observations of markers for cardiac health being modulated in HBM subjects, it is an 
aspect of the present invention to use the novel research methods disclosed herein to screen 
known cardio-protective treatments for bone modulating effects. Thereby, the present 
invention provides therapeutic methods which are both cardio-protective and which improve 
bone quality. The models are useful for testing drugs and researching lipid modulation 
effects related to Zmaxl, LRP6, HBM and HBM like proteins and nucleic acids. 

The effect of various mutations of Zmaxl, HBM and HBM-like genes may be 
investigated by creation of additional lines of transgenic animals according to the invention, 
wherein these animals comprise such mutations. By comparison of direct measures of bone 
development or surrogate markers, an embodiment of the invention provides a useful research 
tool for screening gene therapy reagents, candidate drug therapies, and elucidating molecular 
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mechanisms of bone development modulation. One skilled in the art knows how to use the 
methods of the present invention to achieve these goals. 

The present invention provides a method and useful research tools for testing 
prospective gene therapies. Transgenic knock-out mice are useful for testing prospective 
5 gene therapies. A transgenic knock-out animal such as a mouse is created as described 
above which does not express endogenous Zmaxl or HBM. A prospective gene therapy, 
such as intravenous injection of a recombinant replication-defective adenovirus encoding the 
human HBM protein driven by the CMVpActin promoter, is administered. Parameters of 
bone density and/or surrogate markers are monitored over time following therapy (Ishibashi 

10 et aL, 1993 J. Clin. Invest. 92: 883-93) A TaqMan® primer set such as that described above 
may be used to measure expression of transgenic HBM. One skilled in the art knows 
alternative methods such as the Northern blot method. Comparison of treated and untreated 
animals both within and between groups of germ-line transgenic animals, knock-out (null : 
allele) background, and wild-type endogenous Zmaxl background animals provides 

15 complementary controls for assessing the relative effectiveness of various modalities of gene 
therapy. 

Uses for the transgenic animals models contemplated herein also include, but are not 
limited to: (1) sources for generating bone cell cultures from the calvaria of the transgenic 
animals to study bone cell (e.g., osteoblast and osteoclast) function and number; (2) models 

20 for studying the effects of estrogen loss by ovariectomizing (ovx) the transgenic animals; (3) 
models for testing mechanical loading on the bones and other stress/strength tests; (4) 
breeding models with which to breed to other genetically modified or naturally occurring 
mutant animals that display bone abnormalities; (Chipman et aL, 1993 PNAS, 90: 1701-05; 
Phillips et aL, 2000 Bone 27: 219-26; Kajkenova et aL, 1997 J. BoneMin. Res. 12: 1772-79; 

25 Jilka et aL, 1996 J. Clin. Invest. 97: 1732-40; Takahashi et aL, 1994 Bone and Mineral 24: 
245-55); (5) bone mis/disuse models to test the effects of weight bearing or gravity; (6) 
models for identifying and screening reagents which may or are known to modulate bone 
metabolism (e.g., PTH, estrogen, vitamin D analogs, bisphosphonates, statins, leptin, BMP, 
apoE); (7) models for investigating prospective treatments to improve fracture repair. 
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The transgenic animal models can be analyzed using, but not limited to, such methods 
as bone densitometry by pDEXA, pQCT and microCT; histology, molecular marker analysis, 
apoptosis, cell proliferation, cell cycle, mineralization, serum biochemistry, transcriptional 
profiling, and the like. 

5 

22. Methods of Use: Avian and Mammalian Animal Husbandry 

The Zmaxl (LRP5) DNA and Zmaxl (LRP5) protein and/or the HBM DNA and 
HBM protein (or LRP6 or an HBM-like nucleic acid or protein as also contemplated herein) 
can be used for vertebrate and preferably human therapeutic agents and for avian and 

10 mammalian veterinary agents, including for livestock breeding. Birds, including, for 

example, chickens, roosters, hens, turkeys, ostriches, ducks, pheasants and quails, can benefit 
from the identification of the gene and pathway for high bone mass. In many examples cited 
in literature (for example, McCoy et aL, 1996 Res. Vet Set, 60: 185-6), weakened bones due 
to husbandry conditions cause cage layer fatigue, osteoporosis and high mortality rates. 

1 5 Additional therapeutic agents to treat osteoporosis or other bone disorders in birds can have 
considerable beneficial effects on avian welfare and the economic conditions of the livestock 
industry, including, for example, meat and egg production. 

23. Methods of use: Diagnostic assays using Zmavl - specific oligonucleotides for 
20 detection of genetic alterations affecting hone development 

In cases where an alteration or disease of bone development is suspected to involve an 
alteration of the Zmaxl, LRP6, HBM or HBM-like gene, specific oligonucleotides may be 
constructed and used to assess the level of Zmaxl mRNA or HBM mRNA, respectively, in 
bone tissue or in another tissue that affects bone development. 
25 For example, to test whether a person has the HBM gene, which affects bone density, 

polymerase chain reaction can be used. Two oligonucleotides are synthesized by standard 
methods or are obtained from a commercial supplier of custom-made oligonucleotides. The 
length and base composition are determined by standard criteria using the Oligo 4.0 primer 
u Picking program (Wojchich Rychlik, 1992) or any suitable alternative. One of the 
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oligonucleotides is designed so that it will hybridize only to HBM DNA under the PCR 
conditions used. The other oligonucleotide is designed to hybridize a segment of Zmaxl 
genomic DNA such that amplification of DNA using these oligonucleotide primers produces 
a conveniently identified DNA fragment. For example, the pair of primers 
CCAAGTTCTGAGAAGTCC (SEQ ID NO:32) and AATACCTGAAACCATACCTG (SEQ 
ID NO:33) will amplify a 530 base pair DNA fragment from a DNA sample when the 
following conditions are used: step 1 at 95 °C for 120 seconds; step 2 at 95 °C for 30 seconds; 
step 3 at 58 °C for 30 seconds; step 4 at 72 °C for 120 seconds; where steps 2-4 are repeated 
35 times. Tissue samples may be obtained from hair follicles, whole blood, or the buccal 
cavity. 

The fragment generated by the above procedure is sequenced by standard techniques. 
Individuals heterozygous for the HBM gene will show an equal amount of G and T at the 
second position in the codon for glycine 171. Normal or homozygous wild-type individuals 
will show only G at this position. Similar routine procedures may be used to develop assays 
for other polymorphisms and variants according to the invention. 

Other amplification techniques besides PCR may be used as alternatives, such as 
ligation-mediated PCR or techniques involving Q-beta replicase (Cahill et al. 9 Clin. Chem., 
37(9):1482-5 (1991)). For example, the oligonucleotides 5 *- AGCTGCTC GT AGCTG 
TCTCTCCCTGGATCACGGGTACATGTACTGGACAGACTGGGT-3 f (SEQ ID NO:34) 
and T5-GAGACGCCCCGGATTGAGCGGGCAGGGATAGCTTATTCCCTGTGCCGCA 
TTACGGC-3' (SEQ ID NO: 35) can be hybridized to a denatured human DNA sample, 
treated with a DNA ligase, and then subjected to PCR amplification using the primer 
oligonucleotides S'-AGCTGCTCGTAGCTGTCTCTCCCTGGA-S' (SEQ ID NO:36) and 5'- 
GCCGTAATGCGGC AC AGGGAAT AAGCT-3 1 (SEQ ID NO:37). In the first two 
oligonucleotides, the outer 27 bases are random sequence corresponding to primer binding 
sites, and the inner 30 bases correspond to sequences in the Zmaxl gene. The T at the end of 
the first oligonucleotide corresponds to the HBM gene. The first two oligonucleotides are 
ligated only when hybridized to human DNA carrying the HBM gene, which results in the 
formation of an amplifiable 1 14 bp DNA fragment. 
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Products of amplification can be detected by agarose gel electrophoresis, quantitative 
hybridization, or equivalent techniques for nucleic acid detection known to one skilled in the 
art of molecular biology (Sambrook et aL, Molecular Cloning: A Laboratoiy Manual Cold 
Spring Harbor Laboratoiy, Cold Spring, NY (1989)). 

Other alterations in the Zmaxl gene or the HBM gene (or LRP6 or HBM-like nucleic 
acids, which are also contemplated herein) may be diagnosed by the same type of 
amplification-detection procedures, by using oligonucleotides designed to identify those 
alterations. These procedures can be used in animals as well as humans to identify alterations 
in Zmaxl or HBM that affect bone development. 

Expression of Zmaxl or HBM in bone tissue may be accomplished by fusing the 
cDNA of Zmaxl or HBM, respectively, to a bone-specific promoter in the context of a vector 
for genetically engineering vertebrate cells. DNA constructs are introduced into cells by 
packaging the DNA into virus capsids, by the use of cationic liposomes, electroporation, or 
by calcium phosphate transfection. Transfected cells, preferably osteoblasts, may be studied 
in culture or may be introduced into bone tissue in animals by direct injection into bone or by 
intravenous injection of osteoblasts, followed by incorporation into bone tissue (Ko et al 9 
1996 Cancer Res. 56: 4614-9). For example, the osteocalcin promoter, which is specifically 
active in osteoblasts, may be used to direct transcription of the Zmaxl gene or the HBM gene. 
Any of several vectors and transfection methods may be used, such as retroviral vectors, 
adenovirus vectors, or vectors that are maintained after transfection using cationic liposomes, 
or other methods and vectors described herein. 

Alteration of the level of functional Zmaxl protein or HBM protein affects the level 
of bone mineralization. By manipulating levels of functional Zmaxl protein or HBM protein, 
it is possible to affect bone development and to increase or decrease levels of bone 
mineralization. For example, it may be useful to increase bone mineralization in patients 
with osteoporosis. Alternatively, it may be useful to decrease bone mineralization in patients 
with osteopetrosis or Paget's disease. Alteration of Zmaxl levels or HBM levels can also be 
used as a research tool. Specifically, it is possible to identify proteins, mRNA and other 
molecules whose level or modification status is altered in response to changes in functional 
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levels of Zmaxl or HBM. The pathology and pathogenesis of bone disorders is known and 
described, for example, in Rubin and Farber (Eds.), Pathology , 2nd Ed., S.B. Lippincott Co., 
Philadelphia, PA (1994). 

A variety of techniques can be used to alter the levels of functional Zmaxl or HBM. 
5 For example, intravenous or intraosseous injection of the extracellular portion of Zmaxl or 
mutations thereof, or HBM or mutations thereof, will alter the level of Zmaxl activity or 
HBM activity, respectively, in the body of the treated human, animal or bird. Truncated 
versions of the Zmaxl protein or HBM protein can also be injected to alter the levels of 
functional Zmaxl protein or HBM protein, respectively. Certain forms of Zmaxl or HBM 

10 enhance the activity of endogenous protein, while other forms are inhibitory. 

In a preferred embodiment, the HBM protein is used to treat osteoporosis, fracture, or 
other bone disorder. In a further preferred embodiment, the extracellular portion of HBM or 
fragment thereof (e.g., the Dkk binding domain) protein is used. This HBM protein may be 
optionally modified by the addition of a moiety that causes the protein to adhere to the 

15 surface of cells. The protein is prepared in a pharmaceutically acceptable solution and is 

administered by injection or another method that achieves acceptable pharmacokinetics and 
distribution. 

In a second embodiment of this method, Zmaxl, HBM, HBM variant, and/or LRP6 
levels are increased or decreased by gene therapy techniques. To increase Zmaxl or HBM 

20 levels, osteoblasts or another useful cell type are genetically engineered to express high levels 
of Zmaxl or HBM as described above. Alternatively, to decrease Zmaxl or HBM levels, 
antisense constructs that specifically reduce the level of translatable Zmaxl or HBM mRNA 
can be used. In general, a tissue-nonspecific promoter may be used, such as the CMV 
promoter or another commercially available promoter found in expression vectors (Wu et al. y 

25 1996 Toxicol Appl Pharmacol. 141: 330-9). In a preferred embodiment, a Zmaxl cDNA or 
its antisense is transcribed by a bone-specific promoter, such as the osteocalcin or another 
promoter, to achieve specific expression in bone tissue. In this way, if a Zmaxl-expressing 
DNA construct or HBM-expressing construct is introduced into non-bone tissue, it will not be 
expressed. 
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In a third embodiment of this method, antibodies against Zmaxl, LRP6, HBM-like or 
HBM are used to modulate its function. Such antibodies are identified herein. 

In a fourth embodiment of this method, drugs that are agonists or antagonists of 
Zmaxl function or HBM function are used. Such drugs are described herein and optimized 
5 according to techniques of medicinal chemistry well known to one skilled in the art of 
pharmaceutical development. 

Zmaxl and HBM interact with several proteins, such as ApoE. Molecules that inhibit 
the interaction between Zmaxl or HBM and ApoE or another binding partner are expected to 
alter bone development and mineralization. Such inhibitors may be useful as drugs in the 
10 treatment of osteoporosis, osteopetrosis, or other diseases of bone mineralization. Such 
inhibitors may be low molecular weight compounds, proteins or other types of molecules. 
See, Kim et al 9 1998 J. Biochem. (Tokyo) 124: 1072-1076. 

Inhibitors of the interaction between Zmaxl or HBM and interacting proteins may be 
isolated by standard drug-screening techniques. For example, Zmaxl protein, (or a fragment 
1 5 thereof) or HBM protein (or a fragment thereof) can be immobilized on a solid support such 
as the base of microtiter well. A second protein or protein fragment, such as ApoE is 
derivatized to aid in detection, for example with fluorescein. Iodine, or biotin, then added to 
the Zmaxl or HBM in the presence of candidate compounds that may specifically inhibit this 
protein-protein domain of Zmaxl or HBM, respectively, and thus avoid problems associated 
20 with its transmembrane segment. Drug screens of this type are well known to one skilled in 
the art of pharmaceutical development. 

Because Zmaxl and HBM are involved in bone development, proteins that bind to 
Zmaxl and HBM are also expected to be involved in bone development. Such binding 
proteins can be identified by standard methods, such as co-immunoprecipitation, co- 
25 fractionation, or the two-hybrid screen (Ausubel et aL, Current Protocols in Molecular 

Biology, John Wiley & Sons (1997)). For example, to identify Zmaxl -interacting proteins or 
HBM-interacting proteins using the two-hybrid system, the extracellular domain of Zmaxl or 
HBM is fused to LexA and expressed for the yeast vector pEG202 (the "bait") and expressed 
in the yeast strain EGY48. The yeast strain is transformed with a "prey" library in the 
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appropriate vector, which encodes a galactose-inducible transcription-activation sequence 
fused to candidate interacting proteins. The techniques for initially selecting and 
subsequently verifying interacting proteins by this method are well known to one skilled in 
the art of molecular biology (Ausubel et al. y 1997). 
5 In a preferred embodiment, proteins that interact with HBM, but not Zmaxl , are 

identified using a variation of the above procedure (Xu et al. 9 1997 Proc. Natl. Acad. Set 
USA, 94: 12473-8). This variation of the two-hybrid system uses two baits, and Zmaxl and 
HBM are each fused to LexA and TetR, respectively. Alternatively, proteins that interact 
with the HBM but not Zmaxl are also isolated. These procedures are well known to one 
10 skilled in the art of molecular biology, and are a simple variation of standard two-hybrid 
procedures. 

As an alternative method of isolating substances interacting with Zmaxl or HBM, a 
biochemical approach is used. The Zmaxl protein or a fragment thereof, such as the 
extracellular domain, or the HBM protein or a fragment thereof, such as the extracellular 

15 domain, is chemically coupled to Sepharose beads. The Zmaxl- or HBM-coupled beads are ■* 
poured into a column. A biological extract, such as a lipid fraction, serum proteins, proteins 
in the supernatant of a bone biopsy, or cellular contents from gently lysed osteoblast cells, is 
added to the column. Non-specifically bound compounds are eluted, the column is washed 
several times with a low-salt buffer, and then tightly binding compounds may be eluted with 

20 a high-salt buffer. These are candidate compounds that bind to Zmaxl or HBM, and can be 
tested for specific binding by standard tests and control experiments. Sepharose beads used 
for coupling proteins and the methods for performing the coupling are commercially 
available (Sigma), and the procedures described here are well known to one skilled in the art 
of protein biochemistry. 

25 As a variation of the above procedure, proteins that are eluted by high salt from the 

Zmaxl- or HBM-sepharose column are then added to an HBM-Zmaxl -sepharose column. 
Proteins that flow through without sticking are proteins that bind to Zmaxl but not to HBM. 
Alternatively, proteins that bind to the HBM protein and not to the Zmaxl protein can be 
isolated by reversing the order in which the columns are used. 
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Isolated compounds may be identified by standard methods such as 2D gel 
electrophoresis, chromatography, and mass spectroscopy. 

24. Method of Use: Transformation- Associated Recombination (TAR) dinning 

5 Essential for the identification of novel allelic variants of Zmaxl (LRP5) and LRP6 is 

the ability to examine the sequence of both copies of the gene in an individual. To 
accomplish this, two "hooks," or regions of significant similarity, are identified within the 
genomic sequence such that they flank the portion of DNA that is to be cloned. Most 
preferably, the first of these hooks is derived from sequences 5 f to the first exon of interest 

10 and the second is derived from sequences 3' to the last exon of interest. These two "hooks" 
are cloned into a bacterial/yeast shuttle vector such as that described by Larionov et al 9 1997 
Proc. Natl Acad. Sci. USA, 94: 7384-7. Other similar vector systems may also be used. To 
recover the entire genomic copy of the Zmaxl gene, the plasmid containing the two "hooks" 
is linearized with a restriction endonuclease or is produced by another method such as PCR. 

15 This linear DNA fragment is introduced into yeast cells along with human genomic DNA. 

Typically, the yeast Saccharomyces cerevisiae is used as a host cell, although Kourina et al 9 
1998 Genome Res. 8: 666-72 have reported using chicken host cells as well. During and after 
the process of transformation, the endogenous host cell converts the linear plasmid to a circle 
by a recombination event whereby the region of the human genomic DNA homologous to the 

20 "hooks" is inserted into the plasmid. This plasmid can be recovered and analyzed by methods 
well known to one skilled in the art. Obviously, the specificity for this reaction requires the 
host cell machinery to recognize sequences similar to the "hooks" present in the linear 
fragment. However, 100% sequence identity is not required, as shown by Kouprina et al, 
1998 Genomics 53: 21-8, where the author describes using degenerate repeated sequences 

25 common in the human genome to recover fragments of human DNA from a rodent/human 
hybrid cell line. 

In another example, only one "hook" is required, as described by Larionov et ah, 1998 
Proa Natl Acad. ScL USA, 95: 4469-74. For this type of experiment, termed "radial TAR 
cloning," the other region of sequence similarity to drive the recombination is derived from a 
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repeated sequence from the genome. In this way, regions of DNA adjacent to the Zmaxl 
gene coding region can be recovered and examined for alterations that may affect function, 

25. Methods of Use; Genomic Screening 

5 The use of polymorphic genetic markers linked to the HBM gene, or to Zmaxl 

(LRP5) or to LRP6 is very useful in predicting susceptibility to osteoporosis or other bone 
diseases. Roller et aL, 1998 Amer. J. Bone Min. Res. 13: 1903-8 have demonstrated that the 
use of polymorphic genetic markers is useful for linkage analysis. Similarly, the 
identification of polymorphic genetic markers within the high bone mass gene will allow the 

10 identification of specific allelic variants that are in linkage disequilibrium with other genetic 
lesions that affect bone development. Using the DNA sequence from the BACs, a 
dinucleotide CAn repeat was identified and two unique PCR primers that will amplify the 
genomic DNA containing this repeat were designed, as shown below: 

B200E21C16JL: GAGAGGCTATATCCCTGGGC (SEQ ID NO: 38) 

15 B200E21C16_R: ACAGCACGTGTTTAAAGGGG (SEQ ID NO: 39) 

and used in the genetic mapping study. 

This method has been used successfully by others skilled in the art (e.g., Sheffield et 
aL, 1995 Genet. 9 4:1837-44; LeBlanc-Straceski et aL, 1994 Genomics, 19: 341-9; Chen et aL, 
1995 Genomics, 25:1-8). Use of these reagents with populations or individuals will predict 

20 their risk for osteoporosis. Similarly, single nucleotide polymorphisms (SNPs), such as those 
shown in Table 4 above, can be used as well to predict risk for developing bone diseases or 
resistance to osteoporosis in the case of the HBM gene. 

26. Methods of Use: Modulators of T issue Calcification 

25 The calcification of tissues in the human body is well documented. Towler et aL 

(1998 J. Biol. Chern. 273: 30427-34) demonstrated that several proteins known to regulate 
calcification of the developing skull in a model system are expressed in calcified aorta. The 
expression of Msx2, a gene transcribed in osteoprogenitor cells, in calcified vascular tissue 
indicates that genes which are important in bone development are involved in calcification of 
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other tissues. 

Treatment with HBM protein, HBM-like proteins and polypeptides, agonists of LRP5 
and LRP6 and HBM or antagonists of Dkk-1 are likely to ameliorate calcification (such as the 
vasculature, dentin and bone of the skull viscera) due to its demonstrated effect on bone 
mineral density. In experimental systems where tissue calcification is demonstrated, the 
over-expression or repression of Zmaxl(LRP5) activity permits the identification of 
molecules that are directly regulated by the Zmaxl gene. These genes are potential targets for 
therapeutics aimed at modulating tissue calcification. For example, an animal, such as the 
LDLR -/-, mouse is fed a high fat diet and is observed to demonstrate expression of markers 
of tissue calcification, including Zmaxl. These animals are then treated with antibodies to 
Zmaxl or HBM protein, antisense oligonucleotides directed against Zmaxl or HBM cDNA, 
or with compounds known to bind the Zmaxl or HBM protein or its binding partner or 
ligand. RNA or proteins are extracted from the vascular tissue and the relative expression 
levels of the genes expressed in the tissue are determined by methods well known in the art. 
Genes that are regulated in the tissue are potential therapeutic targets for pharmaceutical 
development as modulators of tissue calcification. 

The nucleic acids, proteins, peptides, amino acids, small molecules or other 
pharmaceutical^/ useful compounds of the present invention that are to be given to an 
individual may be administered in the form of a composition with a pharmaceutical^/ 
acceptable carrier, excipient or diluent, which are well known in the art. The individual may 
be a mammal or a bird, preferably a human, a rat, a mouse or bird. Such compositions may 
be administered to an individual in a pharmaceutical^/ effective amount. The amount 
administered will vary depending on the condition being treated and the patient being treated. 
The compositions may be administered alone or in combination with other treatments. 

27 - Methods to Identify Agents that Modulate the Expression of a NuHmo \ r jd 
Encoding t h e T>kk a n d/o r T . RP 5 Proteins and/or THrfr interacting p roving 

Another embodiment of the present invention provides methods for identifying agents 
that modulate the expression of a nucleic acid encoding Dkk, which is part of the Wnt 
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pathway and that interacts with LRP5, LRP6 and to much lesser extent to HBM and its 
variants. Such assays may utilize any available means of monitoring for changes in the 
expression level of the nucleic acids of the invention. As used herein, an agent is said to 
modulate the expression of Dkk, if it is capable of up- or down-regulating expression of the 
5 nucleic acid in a cell (e.£., mRNA). 

In one assay format, cell lines that contain reporter gene fusions between the nucleic 
acid encoding Dkk (or proteins which modulate the activity of Dkk) and any assayable fusion 
partner may be prepared. Numerous assayable fusion partners are known and readily 
available, including but not limited to the firefly luciferase gene and the gene encoding 

10 chloramphenicol acetyltransferase (Alam et aL, 1990 Anal Biochem. 188: 245-54). Cell lines 
containing the reporter gene fusions are then exposed to the agent to be tested under 
appropriate conditions and time. Differential expression of the reporter gene between 
samples exposed to the agent and control samples identifies agents which modulate the 
expression of a nucleic acid encoding Dkk or other protein which modulates Dkk activity. 

1 5 Such assays can similarly be used to determine whether LRP5 and even LRP6 activity is 
modulated by regulating Dkk activity. This can also be performed with the HBM variants. 

Additional assay formats may be used to monitor the ability of the agent(s) to 
modulate the expression of a nucleic acid encoding Dkk, alone or Dkk and LRP5, and/or Dkk 
interacting proteins such as those identified in Figure 3 1 . For instance, mRNA expression 

20 may be monitored directly by hybridization to the nucleic acids of the invention. Cell lines 
are exposed to the agent to be tested under appropriate conditions and time and total RNA or 
mRNA is isolated by standard procedures such those disclosed in Sambrook et al (1989); 
Ausubel et al, Current Protocols in Molecular Biology (Greene Publishing Co., NY, 1995); 
Maniatis et al, Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory, 

25 Cold Spring Harbor, NY, 1982); and Short Prot ocols in Molecular Biol ogy: A Compendium 
of Methods from Current Protocols in Molecular Biology (Frederick M. Ausubel et aL, April 
1999). 

Probes to detect differences in RNA expression levels between cells exposed to the 
agent and control cells may be prepared from the nucleic acids of the invention. It is 
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preferable, but not necessary, to design probes which hybridize only with target nucleic acids 
under conditions of high stringency. Only highly complementary nucleic acid hybrids form 
under conditions of high stringency. Accordingly, the stringency of the assay conditions 
determines the amount of complementarity which should exist between two nucleic acid 
strands in order to form a hybrid. Stringency should be chosen to maximize the difference in 
stability between the probe:target hybrid and potential probemon-target hybrids. 

Probes may be designed from the nucleic acids of the invention through methods 
known in the art. For instance, the G+C content of the probe and the probe length can affect 
probe binding to its target sequence. Methods to optimize probe specificity are commonly 
available. See for example, Sambrook et al. (1989) or Ausubel et al. (Current Protocols m 
Molecular Biology, Greene Publishing Co., NY, 1995). 

Hybridization conditions are modified using known methods, such as those described 
by Sambrook et al. (1989) and Ausubel et al. (1995), as suitable for each probe. 
Hybridization of total cellular RNA or RNA enriched for polyA RNA can be accomplished in 
any available format. For instance, total cellular RNA or RNA enriched for polyA RNA can 
be affixed to a solid support and the solid support exposed to at least one probe comprising at 
least one, or part of one of the nucleic acid sequences of the invention under conditions in 
which the probe will specifically hybridize. Alternatively, nucleic acid fragments comprising 
at least one, or part of one of the sequences of the invention can be affixed to a solid support, 
such as a porous glass wafer. The glass or silica wafer can then be exposed to total cellular 
RNA or polyA RNA from a sample under conditions in which the affixed sequences will 
specifically hybridize. Such glass wafers and hybridization methods are widely available, for 
example, those disclosed by Beattie (WO 95/1 1755). By examining for the ability of a given 
probe to specifically hybridize to an RNA sample from an untreated cell population and from 
a cell population exposed to the agent, agents which up- or down-regulate the expression of a 
nucleic acid encoding Dkk, a Dkk interacting protein, and/or LRP5 can be identified. 

Microarray technology and transcriptional profiling are examples of methods which 
can be used to analyze the impact of putative Dkk or Dkk interacting protein modulating 
compounds. For transcriptional profiling, mRNA from cells exposed in vivo to a potential 

188 



BNSDOCID: <WO 02092000A2_I_> 



WO 02/092000 PCT/US02/14877 



Dkk modulating agent, such as the Dkk interacting proteins identified in the present invention 
(e.g., those identified in Fig. 31), agents which modulate Dkk interacting proteins, and 
mRNA from the same type of cells that were not exposed to the agent could be reverse 
transcribed and hybridized to a chip containing DNA from numerous genes, to thereby 
5 compare the expression of genes in cells treated and not treated with the agent. If, for 

example a putative Dkk modulating agent down-regulates the expression of Dkk in the cells, 
then use of the agent may be undesirable in certain patient populations. For additional 
methods of transcriptional profiling and the use of microarrays, refer to, for example, U.S. 
Patent No. 6,124,120 issued to Lizardi (2000). 

1 0 Additional methods for screening the impact of Dkk and Dkk interacting protein 

modulating compounds or the impact of Dkk or Dkk interacting proteins on modulation of 
LRP5, LRP6, HBM, HBM variants or the Wnt pathway include the use of TaqMan® PCR, 
conventional reverse transcriptase PCR (RT-PCR), changes in downstream surrogate markers 
(i.e., Wnt responsive genes), and anti-Dkk Western blots for protein detection. Other 

1 5 methods would be readily apparent to the artisan of ordinary skill. 

28. Methods to Identify Agents that Modulate at Least One Activity of Dkk, a Dkk 
Interacting Protein, or I.RPS/LRPfi/HBM/HBM-like 

Another embodiment of the present invention provides methods for identifying agents 
20 that modulate at least one activity of Dkk, Dkk interacting proteins, and/or 

LRP5/LRP6/HBM/HBM-like proteins or preferably which specifically modulate an activity 
of a Dkk/Dkk interacting protein complex or an LRP5 (or LRP6/HBM/HBM-like)/Dkk 
complex, or a biologically active fragment of Dkk (e.g., comprising the domain which binds 
LRP5/LRP6/HBM/HBM-like) or a Dkk interacting protein complex. Such methods or 
25 assays may utilize any means of monitoring or detecting the desired activity as would be 
known in the art (See, e.g., Wu et al., 2000 Curr. Biol 10:161 1-4; Fedi et al, 199 J. Biol. 
Chem. 274:19465-72; Grotewold et a!., 1999 Mech. Dev. 89:151-3; Shibata et al, 2000 
Mech. Dev. 96: 243-6 ; Wang et al, 2000 Oncogene 19: 1843-8; and Glinka et al, 1998 
Nature 391 : 357-62). Potential agents which modulate Dkk include, for example, p53, the 
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tumor suppressor protein, which can induce Dkk-1. Damage to DNA has also been observed 
to up-regulate Dkk-1 expression via a stabilization and activation of p53 (Wang et al, 2000 
Oncogene 19: 1843-48); and, Shou et al, 2002 Oncogene 21: 878-89). Additionally, Fedi et 
al. (1999) purportedly showed that Dkk-1 can block the Wnt2-induced oncogenic 
transformation of NIH-3T3 cells. Furthermore, it has been suggested that Dkk expression 
may be modulated by BMP signaling in the developing skeleton (Mukhopadhyay et al, 2001 
Dev. Cell. 1 : 423-34; and Grotewold et al, 2002 EMBO J. 21 : 966-75). Grotewald et al 
additionally describe altered Dkk expression levels in response to stress signals including UV 
irradiation and other genotoxic stimuli. They propose that Dkk expression is pro-apoptotic. 
In the HBMMTIC animals described herein, a reduced osteoblast apoptosis effect was 
observed. Thus, HBM and HBM like variants may control/alter Dkk's role in programmed 
cell death. Other agents which potentially modulate Dkk activity include the Dkk interacting 
proteins identified in Fig. 31. 

In one embodiment, the relative amounts of Dkk or a Dkk interacting protein of a cell 
population that has been exposed to the agent to be tested is compared to an un-exposed 
control cell population. Antibodies can be used to monitor the differential expression of the 
protein in the different cell populations. Cell lines or populations are exposed to the agent to 
be tested under appropriate conditions and time. Cellular lysates may be prepared from the 
exposed cell line or population and a control, unexposed cell line or population. The cellular 
lysates are then analyzed with the probe, as would be known in the art. See, e.g., Ed Harlow 
and David Lane, Antibodies: A Laboratory Mar> 1ia | (Cold Spring Harbor, NY, 1988) and Ed 
Harlow and David Lane, Usin g Antibodies: A T ,ahoratory Manual (Cold Spring Harbor, NY 
1998). 

For example, N- and C- terminal fragments of Dkk can be expressed in bacteria and 
used to search for proteins which bind to these fragments. Fusion proteins, such as His-tag or 
GST fusion to the N- or C-terminal regions of Dkk (or to biologically active domains of Dkk- 
1) or a whole Dkk protein can be prepared. These fusion proteins can be coupled to, for 
example, Talon or Glutathione-Sepharose beads and then probed with cell lysates to identify 
molecules which bind to Dkk. Prior to lysis, the cells may be treated with purified Wnt 
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proteins, RNA, or drugs which may modulate Wnt signaling or proteins that interact with 
downstream elements of the Wnt pathway. Lysate proteins binding to the fusion proteins can 
be resolved by SDS-PAGE, isolated and identified by, for example protein sequencing or 
mass spectroscopy, as is known in the art. See, e.g., Protein Purification Applications: A 
5 Practical Approach (Simon Roe, ed., 2 nd ed. Oxford Univ. Press, 2001) and "Guide to Protein 
Purification" in Meth. Enzymology vol. 182 (Academic Press, 1997). 

The activity of Dkk, a Dkk interacting protein, or a complex of Dkk with 
LRP5/LRP6/HBM/HBM-like may be affected by compounds which modulate the interaction 
between Dkk and a Dkk interacting protein (such as those shown in Fig. 31) and/or Dkk and 

10 LRP5/LRP6/HBM/HBM-like. The present invention provides methods and research tools 

for the discovery and characterization of these compounds. The interaction between Dkk and 
a Dkk interacting protein and/or Dkk and LRP5/6/HBM/HBM-like may be monitored in vivo 
and in vitro. Compounds which modulate the stability of a Dkk - LRP5/LRP6/HBM/HBM- 
like complex are potential therapeutic compounds. Example in vitro methods include: 

1 5 Binding LRP5/6/HBM/HBM-like, Dkk, or a Dkk interacting protein to a sensor chip 

designed for an instrument such are made by Biacore (Uppsala, Sweden) for the performance 
of an plasmon resonance spectroscopy observation. In this method, the chip with one of Dkk, 
a Dkk interacting protein, or LRP5/6 is first exposed to the other under conditions which 
permit them to form the complex. A test compound is then introduced and the output signal 

20 of the instrument provides an indication of any effect exerted by the test compound. By this 
method, compounds may be rapidly screened. Another, in vitro, method is exemplified by 
the SAR-by-NMR methods (Shuker et al. 9 Science. 274:1531-4 (1996)). Briefly, a Dkk-1 
binding domain and/or LRP 5 or 6 LBD are expressed and purified as 15 N labeled protein by 
expression in labeled media. The labeled protein(s) are allowed to form the complex in 

25 solution in an NMR sample tube. The heteronuclear correlation spectrum in the presence and 
absence of a test compound provides data at the level of individual residues with regard to 
interactions with the test compound and changes at the protein-protein interface of the 
complex. One of skill in the art knows of many other protocols, e.g. affinity capillary 
electrophoresis (Okun et al, 2001 J. Biol. Chem. 276: 1057-62; Vergnon et al, 1999 
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Methods, 19: 270-7), fluorescence spectroscopy, electron paramagnetic resonance, etc. which 
can monitor the modulation of a complex and/or measure binding affinities for complex 
formation. 

In vitro protocols for monitoring the modulation of a DkJc/LRP5/LRP6/HBM/HBM- 
like complex include the yeast two hybrid protocol. The yeast two hybrid method may be 
used to monitor the modulation of a complex in vivo by monitoring the expression of genes 
activated by the formation of a complex of fusion proteins of Dkk and LRP ligand binding 
domains. Nucleic acids according to the invention which encode the interacting Dkk and 
LRP LBD domains are incorporated into bait and prey plasmids. The Y2H protocol is 
performed in the presence of one or more test compounds. The modulation of the complex is 
observed by a change in expression of the complex activated gene. It will be appreciated by 
one skilled in the art that test compounds can be added to the assay directly or, in the case of 
proteins, can be coexpressed in the yeast with the bait and prey compounds. Similarly, 
fusion proteins of Dkk and Dkk interacting proteins can also be used in a Y2H screen to 
identify other proteins which modulate the Dkk/Dkk interacting protein complex. 

Assay protocols such as these may be used in methods to screen for compounds, 
drugs, treatments which modulate the Dkk/Dkk interacting protein and/or Dkk/LRP5/6 
complex, whether such modulation occurs by competitive binding, or by altering the 
structure of either LRP 5/6 or Dkk at the binding site, or by stabilizing or destablizing the 
protein-protein interface. It may be anticipated that peptide aptamers may competitively 
bind, although induction of an altered binding site structure by steric effects is also possible. 

28.1 Antibodies and Antibo dy Fragments 

Polyclonal and monoclonal antibodies and fragments of these antibodies which bind 
to Dkk or LRP5/LRP6/HBM/HBM-like can be prepared as would be known in the art. For 
example, suitable host animals can be immunized using appropriate immunization protocols 
and the peptides, polypeptides or proteins of the invention. Peptides for use in immunization 
are typically about 8-40 residues long. If necessary or desired, the polypeptide immunogens 
can be conjugated to suitable carriers. Methods for preparing immunogenic conjugates with 
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carriers such as bovine serum albumin (BSA), keyhole limpet hemocyanin (KLH), or other 
carrier proteins are well known in the art (See, Harlow et aL 9 1988). In some circumstances, 
direct conjugation using, for example, carbodiimide reagents, may be effective; in other 
instances linking reagents such as those supplied by Pierce Chemical Co., Rockford, IL, may 
5 be desirable to provide accessibility to the polypeptide or hapten. The hapten peptides can be 
extended at either the amino or carboxy terminus with a cysteine residue or interspersed with 
cysteine residues, for example, to facilitate linking to a carrier. Administration of the 
immunogens is conducted generally by injection over a suitable time period and with use of 
suitable adjuvants, as is generally understood in the art. During the immunization schedule, 

10 titers of antibodies are taken to determine adequacy of antibody formation. 

Anti-peptide antibodies can be generated using synthetic peptides, for example, the 
peptides derived from the sequence of any Dkk, including Dkk-1, or 
LRP5/LRP6/HBM/HBM-like. Synthetic peptides can be as small as 2-3 amino acids in 
length, but are preferably at least 3, 5, 10, or 15 or more amino acid residues long. Such 

15 peptides can be determined using programs such as DNAStar. The peptides are coupled to .*■- 
KLH using standard methods and can be immunized into animals such as rabbits. Polyclonal 
anti-Dkk or anti-LRP5/LRP6/HBM/HBM-like peptide antibodies can then be purified, for 
example using Actigel beads containing the covalently bound peptide. 

While the polyclonal antisera produced in this way may be satisfactory for some 

20 applications, for pharmaceutical compositions, use of monoclonal preparations is preferred. 
Immortalized cell lines which secrete the desired monoclonal antibodies may be prepared 
using the standard method of Kohler and Milstein or modifications which effect 
immortalization of lymphocytes or spleen cells, as is generally known (See, e.g., Harlow et 
aL, 1988 and 1998). The immortalized cell lines secreting the desired antibodies can be 

25 screened by immunoassay in which the antigen is the peptide hapten, polypeptide or protein. 
When the appropriate immortalized cell culture secreting the desired antibody is identified, 
the cells can be cultured either in vitro or by production in ascites fluid. 

The desired monoclonal antibodies are then recovered from the culture supernatant or 
from the ascites supernatant. Fragments of the monoclonal antibodies which contain the 
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immunologically significant portion can be used as agonists or antagonists of Dklc activity. 
Use of immunologically reactive fragments, such as the Fab, scFV, Fab f , of F(ab') 2 fragments 
are often preferable, especially in a therapeutic context, as these fragments are generally less 
immunogenic than the whole immunoglobulin. 
5 The antibodies or fragments may also be produced, using current technology, by 

recombinant means. Regions that bind specifically to the desired regions of Dkk or 
LRP5/LRP6/HBM/HBM-like can also be produced in the context of chimeras with multiple 
species origin. Antibody reagents so created are contemplated for use diagnostically or as 
stimulants or inhibitors of Dkk activity. 
10 In one embodiment, antibodies against Dkk, bind Dkk with high affinity, i.e., ranging 

from 10~ 5 to 10* 9 M. Preferably, the anti-Dkk antibody will comprise a chimeric, primate, 
Primatized®, human or humanized antibody. Also, the invention embraces the use of 
antibody fragments, e.g., Fab's, Fv's, Fab's, F(ab) 2 , and aggregates thereof. 

Another embodiment contemplates chimeric antibodies which recognize Dkk or 
15 LRP5/LRP6/HBM/HBM-like. A chimeric antibody is intended to refer to an antibody with 
non-human variable regions and human constant regions, most typically rodent variable 
regions and human constant regions. 

A "primatized® antibody" refers to an antibody with primate variable regions, e.g., 
CDR's, and human constant regions. Preferably, such primate variable regions are derived 
20 from an Old World monkey. 

A "humanized antibody" refers to an antibody with substantially human framework 
and constant regions, and non-human complementarity-determining regions (CDRs). 
"Substantially" refers to the fact that humanized antibodies typically retain at least several 
donor framework residues {i.e., of non-human parent antibody from which CDRs are 
25 derived). 

Methods for producing chimeric, primate, primatized®, humanized and human 
antibodies are well known in the art. See, e.g., U.S. Patent 5,530,101, issued to Queen et air, 
U.S. Patent 5,225,539, issued to Winter et al; U.S. Patents 4,816,397 and 4,816,567, issued 
to Boss et al and Cabilly et al respectively, all of which are incorporated by reference in 
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their entirety. 

The selection of human constant regions may be significant to the therapeutic efficacy 
of the subject anti-Dkk or LRP5fLKP6/HBM/HBMAik& antibody. In a preferred 
embodiment, the subject anti-Dkk or LRP5/LRP6/HBM/HBM-like antibody will comprise 
5 human, gamma 1, or gamma 3 constant regions and, more preferably, human gamma 1 
constant regions. 

Methods for making human antibodies are also known and include, by way of 
example, production in SCID mice, and in vitro immunization. 

The subject anti-Dkk or LRP5/LRP6/EIBM/HBM-like antibodies can be administered 
10 by various routes of administration, typically parenteral. This is intended to include 
intravenous, intramuscular, subcutaneous, rectal, vaginal, and administration with 
intravenous infusion being preferred. 

The anti-Dkk or LRP5/LRP6/HBM/HBM-like antibody will be formulated for 
therapeutic usage by standard methods, e.g., by addition of pharmaceutical^ acceptable 
15 buffers, e.g., sterile saline, sterile buffered water, propylene glycol, and combinations thereof. 

Effective dosages will depend on the specific antibody, condition of the patient, age, 
weight, or any other treatments, among other factors. Typically effective dosages will range 
from about 0.001 to about 30 mg/kg body weight, more preferably from about 0.01 to 25 
mg/kg body weight, and most preferably from about 0.1 to about 20 mg/kg body weight. 
20 Such administration may be effected by various protocols, e.g., weekly, bi-weekly, or 

monthly, depending on the dosage administered and patient response. Also, it may be 
desirable to combine such administration with other treatments. 

Antibodies to Dkk-1 interacting proteins, such as those identified in Fig. 3 1, are also 
contemplated according to the present invention, and can be used similarly to the Dkk-1 
25 antibodies mentioned in the above methodology. 

The antibodies of the present invention can be utilized in experimental screening, as 
diagnostic reagents, and in therapeutic compositions. 

28.2 Chemical Librarie s 
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Agents that are assayed by these methods can be randomly selected or rationally 
selected or designed. As used herein, an agent is said to be randomly selected when the agent 
is chosen randomly without considering the specific sequences involved in the association of 
Dkk-1 alone, Dkk-1 interacting proteins alone, or with their associated substrates, binding 
partners, etc. An example of randomly selected agents is the use of a chemical library or a 
peptide combinatorial library, or a growth broth of an organism. 

The agents of the present invention can be, as examples, peptides, small molecules, 
vitamin derivatives, as well as carbohydrates. A skilled artisan can readily recognize that 
there is no limit as to the structural nature of the agents of the present invention. 

28.3 Peptide Synthesis 

The peptide agents of the invention can be prepared using standard solid phase (or 
solution phase) peptide synthesis methods, as is known in the art. In addition, the DNA 
encoding these peptides may be synthesized using commercially available oligonucleotide 
synthesis instrumentation and produced recombinantly using standard recombinant 
production systems. The production of polypeptides using solid phase peptide synthesis is 
necessitated if non-nucleic acid-encoded amino acids are to be included. 



29. Uses for Agents that Modul ate at Le a st O n e Acti vity of Dkk, * Pkk Interacting 
Protein, a Dkk/Dk k Interacting Protein Comp lex or a Dkk/I >KP 5 or Dkk/T/RPfi 
Co mplex 

The proteins and nucleic acids of the invention, such as the proteins or polypeptides 
containing an amino acid sequence of LRP5, Dick, and Dkk interacting proteins are involved 
in bone mass modulation and lipid modulation of other Wnt pathway mediated activity. 
Agents that modulate {i.e., up and down-regulate) the expression of Dkk or Dkk interacting 
proteins, or agents, such as agonists and antagonists respectively, of at least one activity of 
Dkk or a Dkk interacting protein may be used to modulate biological and pathologic 
processes associated with the function and activity of Dkk or a Dkk interacting protein. 

As used herein, a subject can be preferably any mammal, so long as the mammal is in 
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need of modulation of a pathological or biological process modulated by a protein of the 
invention. The term "mammal" means an individual belonging to the class Mammalia. The 
invention is particularly useful in the treatment of human subjects. 

As used herein, a biological or pathological process modulated by Dkk or a Dkk 
5 interacting protein may include binding of Dkk to a Dkk interacting protein, Dkk to LRP5 or 
LRP6 or release therefrom, inhibiting or activating Dkk or a Dkk interacting protein mRNA 
synthesis or inhibiting Dkk or Dkk interacting protein modulated inhibition of LRP5 or LRP6 
mediated Wnt signaling. Further bone-related markers may be observed such as alkaline 
phosphatase activity, osteocalcin production, or mineralization. 

10 Pathological processes refer to a category of biological processes which produce a 

deleterious effect. For example, expression or up-regulation of expression of LRP5 or LRP6 
and/or Dkk and/or a Dkk interacting protein may be associated with certain diseases or 
pathological conditions. As used herein, an agent is said to modulate a pathological process 
when the agent statistically significantly (p < 0.05) alters the process from its base level in 

15 the subject. For example, the agent may reduce the degree or severity of the process 

mediated by that protein in the subject to which the agent was administered. For instance, a 
disease or pathological condition may be prevented, or disease progression modulated by the 
administration of agents which reduce or modulate in some way the expression or at least one 
activity of a protein of the invention. 

20 As LRP5/6 and Dkk are involved both directly and indirectly in bone mass 

modulation, one embodiment of this invention is to use Dkk or Dkk interacting protein 
expression as a method of diagnosing a bone condition or disease. Certain markers are 
associated with specific Wnt signaling conditions (e.g., TCF/LEF activation). Diagnostic 
tests for bone conditions may include the steps of testing a sample or an extract thereof for 

25 the presence of Dkk or Dkk interacting protein nucleic acids (i.e., DNA or RNA), oligomers 
or fragments thereof or protein products of TCF/LEF regulated expression. For example, 
standard in situ hybridization or other imaging techniques can be utilized to observe products 
of Wnt signaling. Other diagnostic techniques, as described herein, would also be useful as 
would be apparent to the skilled artisan (e.g., a serum marker). 
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This invention also relates to methods of modulating bone development or bone loss 
conditions. Inhibition of bone loss may be achieved by inhibiting or modulating changes in 
the LRP5/6 mediated Wnt signaling pathway. For example, absence of LRP5 activity may 
be associated with low bone mass. Increased activity LRP5 may be associated with high 
bone mass. Therefore, modulation of LRP5 activity will in turn modulate bone development. 
Modulation of the Dkk/LRP5/6 or Dkk/Dkk interacting protein complex via agonists and 
antagonists is one embodiment of a method to regulate bone development. Such modulation 
of bone development can result from inhibition of the activity of, for example, a 
Dkk/LRP(5/6) protein complex, a Dkk/Dkk interacting protein complex, upregulated 
transcription of the LRP5 gene or inhibited transcription or translation of Dkk or Dkk 
interacting protein mRNA. 

The agents of the present invention can be provided alone, or in combination with 
other agents that modulate a particular pathological process. As used herein, two agents are 
said to be administered in combination when the two agents are administered simultaneously 
or are administered independently in a fashion such that the agents will act at the same time. 

The agents of the present invention can be administered via parenteral, subcutaneous 
(sc), intravenous (iv), intramuscular (im), intraperitoneal (ip), transdermal or buccal routes. 
Alternatively, or concurrently, administration may be by the oral route. The dosage 
administered will be dependent upon the age, health, and weight of the recipient, kind of 
concurrent treatment, if any, frequency of treatment, and the nature of the effect desired. 

The present invention further provides compositions containing one or more agents 
which modulate expression or at least one activity of a protein of the invention. While 
individual needs vary, determination of optimal ranges of effective amounts of each 
component is within the skill of the art. Typical dosages of the active agent which mediate 
Dkk or Dkk interacting protein activity comprise from about 0.0001 to about 50 mg/kg body 
weight. The preferred dosages comprise from about 0.001 to about 50 mg/kg body weight. 
The most preferred dosages comprise from about 0.1 to about 1 mg/kg body weight. In an 
average human of 70 kg, the range would be from about 7 ug to about 3.5 g, with a preferred 
range of about 0.5 mg to about 5 mg. 
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In addition to the pharmacologically active agent, the compositions of the present 
invention may contain suitable pharmaceutically acceptable carriers comprising excipients 
and auxiliaries which facilitate processing of the active compounds into preparations which 
can be used pharmaceutically for delivery to the site of action. Suitable formulations for 
5 parenteral administration include aqueous solutions of the active compounds in water-soluble 
form, for example, water-soluble salts. In addition, suspensions of the active compounds as 
appropriate oily injection suspensions may be administered. Suitable lipophilic solvents or 
vehicles include fatty oils, for example, sesame oil, or synthetic fatty acid esters, {e.g., ethyl 
oleate or triglycerides). Aqueous injection suspensions may contain substances which 
10 increase the viscosity of the suspension include, for example, sodium carboxymethyl 

cellulose, sorbitol and/or dextran. Optionally, the suspension may also contain stabilizers. 
Liposomes and other non-viral vectors can also be used to encapsulate the agent for delivery 
into the cell. 

The pharmaceutical formulation for systemic administration according to the 
15 invention may be formulated for enteral, parenteral, or topical (top) administration. Indeed, 
all three types of formulations may be used simultaneously to achieve systemic 
administration of the active ingredient. 

Suitable formulations for oral administration include hard or soft gelatin capsules, 
pills, tablets, including coated tablets, elixirs, suspensions, syrups or inhalations and 
20 controlled release forms thereof. 

Potentially, any compound which binds Dkk or a Dkk interacting protein or 
modulates the Dkk/LRP5 or Dkk/LRP6 or Dkk/Dkk interacting protein complex may be a 
therapeutic compound. In one embodiment of the invention, a peptide or nucleic acid 
aptamer according to the invention is used in a therapeutic composition. Such compositions 
25 may comprise an aptamer, or a LRP5 or LRP6 fragment unmodified or modified. In another 
embodiment, the therapeutic compound comprises a Dkk-1 interacting protein, or 
biologically active fragment thereof. 

Nucleic acid aptamers have been used in compositions for example by chemical 
bonding to a carrier molecule such as polyethylene glycol (PEG) which may facilitate uptake 
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or stabilize the aptamer. A di-alkylgylcerol moiety attached to an RNA will embed the 
aptamer in liposomes, thus stabilizing the compound. Incorporating chemical substitutions 
(i.e. changing the 2'OH group of ribose to a 2TSEH in RNA confers ribonuclease resistance) 
and capping, etc. can prevent breakdown. Several such techniques are discussed for RNA 
5 aptamers in Brody and Gold (Rev. Mol. Biol. 74:3-13, 2000). 

Peptide aptamers may by used in therapeutic applications by the introduction of an 
expression vector directing aptamer expression into the affected tissue such as for example by 
retroviral delivery, by encapsulating the DNA in a delivery complex or simple by naked 
DNA injection. Or, the aptamer itself or a synthetic analog may be used directly as a drug. 
10 Encapsulation in polymers and lipids may assist in delivery. The use of peptide aptamers as 
therapeutic and diagnostic agents is reviewed by Hoppe-Syler and Butz (J. Mol. Med. 
78:426-430 (2000)). 

In another aspect of the invention. The structure of a constrained peptide aptamer of 
the invention may be determined such as by NMR or X-ray crystallography. (Cavanagh et 

15 a l > Protein NMR Spectroscopy: Principles and Emetics, Academic Press, 1996; Drenth, 

Principles of Protein X-Ray Crystallography, Springer Verlag, 1999) Preferably the structure 
is determined in complex with the target protein. A small molecule analog is then designed 
according to the positions of functional elements of the 3D structure of the aptamer. 
(Guidebook on Molecular Modeling in Drug Design, Cohen, Ed., Academic Press, 1996; 

20 Molecular Modeling and Drue D esim (Topics in Mo lecular anri Structural Biolog y), vinter 
and Gardner Eds., CRC Press, 1994) Thus the present invention provides a method for the 
design of effective and specific drugs which modulate the activity of Dkk, Dkk interacting 
proteins, Dkk/Dkk interacting protein complex and the Dkk/LRP complex. Small molecule 
mimetics of the peptide aptamers of the present invention are encompassed within the scope 

25 of the invention. 

In practicing the methods of mis invention, the compounds of this invention may be 
used alone or in combination, or in combination with other therapeutic or diagnostic agents. 
In certain preferred embodiments, the compounds of this invention may be co-administered 
along with other compounds typically prescribed for these conditions according to generally 
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accepted medical practice. For example, the compounds of this invention can be 
administered in combination with other therapeutic agents for the treatment of bone loss. 
Bone loss mediating agents include bone resorption inhibitors such as bisphosphonates (e.g., 
alendronic acid, clodronic acid, etidronic acid, pamidronic acid, risedronic acid and tiludronic 
5 acid), vitamin D and vitamin D analogs, cathepsin K inhibitors, hormonal agents (e.g., 
calcitonin and estrogen), and selective estrogen receptor modulators or SERMs (e.g., 
raloxifene). And bone forming agents such as parathyroid hormone (PTH) and bone 
morphogenetic proteins (BMP). 

Additionally contemplated are combinations of agents which regulate Dkk-1 and 
10 agents which regulate lipid levels such as HMG-CoA reductase inhibitors (i.e., statins such as 
Mevacor®, Lipitor® and other inhibitors such as Baycol®, Lescol®, Pravachol® and 
Zocor®), bile acid sequestrants (e.g., Colestid® and Welchol®), fibric acid derivatives 
(Atromid-S®, Lopid®, Tricor®), and nicotinic acid. 

The compounds of this invention can be utilized in vivo, ordinarily in vertebrates and 
15 preferably in mammals, such as humans, sheep, horses, cattle, pigs, dogs, cats, rats and mice, 
or in vitro. 

30. Peptide and Nucleotide Apta mers and Peptide Aptamer Mimetics 

Another embodiment contemplates the use of peptide and nucleotide aptamer 
20 technology to screen for agents which interact with Dfck, which block Dkk from interacting 
with LRP5, LRP6, HBM, or HBM-like molecules, or which block any other Dkk ligand 
interaction, or which interact with Dkk interacting proteins, such as those shown in Figure 5. 
Peptide aptamers are molecules in which a variable peptide domain is displayed from a 
scaffold protein. Thioredoxin A (trxA) is commonly used for a scaffold. The peptide insert 
25 destroys the catalytic site of trxA. It is recognized that numerous proteins may also be used 
as scaffolding proteins to constrain and/or present a peptide aptamer. Other scaffold proteins 
that could display a constrained peptide aptamer could include staphylococcal nuclease, the 
protease inhibitor eglin C, the Streptomyces tendea alpha-amylase inhibitor Tendamistat, 
Spl, green fluorescent protein (GFP) (reviewed in Hoppe-Seyler et al, 2001 J. Steroid 
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Biochem Mol Biol 78: 105-1 1), and the SI nucleas from Staphyloccus or M13 for phage 
display.. Any molecule to which the aptamer could be anchored and presented in its 
bioactive conformation would be suitable. 

Aptamers can then specifically bind to a given target protein in vitro and in vivo and 
5 have the potential to selectively block the function of their target protein. Peptide aptamers 
are selected from randomized expression libraries on the basis of their in vivo binding 
capacity to the desired target protein. Briefly, a target protein (e.g., Dkk, a Dkk interacting 
protein, or LRP5/6) is linked to a heterologous DNA binding domain (BD) and expressed as 
bait in a yeast test strain. Concomitantly, a library coding for different peptides (e.g., 16- 

10 mers) of randomized sequence inserted in a scaffold protein sequence, which are linked to a 
heterologous transcriptional activation domain (AD) is expressed as prey. If a peptide binds 
to a target protein, a functional transcription factor is reconstituted, in which the BD and AD 
are bridged together by interacting proteins. This transcription factor is then able to activate 
the promoter of a marker gene which can be monitored by colorimetric enzymatic assays or 

15 by growth selection. Additional variation, methods of preparing and screening 

methodologies are described in, for example, Hoppe-Seyler et al, 2000 J. Mol Med. 78: 426- 
430. 

Nucleotide aptamers are described for example in Brody et al, 2000 Trends Mol 
Biotechnol 74: 5-13. Additional methods of making and using nucleotide aptamers include 

20 SELEX, z.e., Systematic Evolution of Ligands by Exponential Enrichment. SELEX is a 

process of isolating oligonucleotide ligands of a chosen target molecule (see Tuerk and Gold, 
Science 249:505-510 (1990); U.S. Pat Nos. 5,475,096, 5,595,877, and 5,660,985). SELEX, 
as described in Tuerk and Gold, involves admixing the target molecule with a pool of 
oligonucleotides {e.g., RNA) of diverse sequences; retaining complexes formed between the 

25 target and oligonucleotides; recovering the oligonucleotides bound to the target; reverse- 
transcribing the RNA into DNA; amplifying the DNA with polymerase chain reactions 
(PCR); transcribing the amplified DNA into RNA; and repeating the cycle with ever 
increasing binding stringency. Three enzymatic reactions are required for each cycle. It 
usually takes 12-15 cycles to isolate aptamers of high affinity and specificity to the target. 
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An aptamer is an oligonucleotide that is capable of binding to an intended target substance 
but not other molecules under the same conditions. 

In another reference, Bock et aL, 1990 Nature 355: 564-6, describe a different process 
from the SELEX method of Tuerk and Gold in that only one enzymatic reaction is required 
5 for each cycle (i.e., PCR) because the nucleic acid library in Bock's method is comprised of 
DNA instead of RNA. The identification and isolation of aptamers of high specificity and 
affinity with the method of Bock et aL still requires repeated cycles in a chromatographic 
column. 

Other nucleotide aptamer methods include those described by Conrad et aL, 1996 

10 Meth. EnzymoL 261: 336-367. Conrad et aL describe a variety of methods for isolating 

aptamers, all of which employ repeated cycles to enrich target-bound ligands and require a 
large amount of purified target molecules. More recently described methods of making and 
using nucleotide aptamers include, but are not limited to those described in U.S. Patent Nos. 
6,180,348; 6,051,388; 5,840,867; 5,780,610, 5,756,291 and 5,582,981. 

1 5 Potentially, any compound which binds Dkk or a Dkk interacting protein or 

modulates the Dkk/Dkk interacting protein or Dkk/LRP5, Dkk/LRP6, Dkk/HBM, or 
Dkk/HBM-like complex may be a therapeutic compound. In one embodiment of the 
invention, a peptide or nucleic acid aptamer according to the invention is used in a 
therapeutic composition. Such compositions may comprise an aptamer, or a LRP5 or LRP6 

20 fragment unmodified or modified. 

Nucleic acid aptamers have been used in compositions for example by chemical 
bonding to a carrier molecule such as polyethylene glycol (PEG) which may facilitate uptake 
or stabilize the aptamer. A di-alkylglycerol moiety attached to an RNA will embed the 
aptamer in liposomes, thus stabilizing the compound. Incorporating chemical substitutions 

25 (i.e., changing the 2 , -OH group of ribose to a 2'-NH in RNA confers ribonuclease resistance) 
and capping, etc. can prevent breakdown. Several such techniques are discussed for RNA 
aptamers in Brody et aL, 2000 Rev. MoL Biol. 74: 3-13. 

Peptide aptamers may by used in therapeutic applications by the introduction of an 
expression vector directing aptamer expression into the affected tissue such as for example by 
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retroviral delivery, by encapsulating the DNA in a delivery complex or simple by naked 
DNA injection. Or, the aptamer itself or a synthetic analog may be used directly as a drug. 
Encapsulation in polymers and lipids may assist in delivery. The use of peptide aptamers as 
therapeutic and diagnostic agents is reviewed by Hoppe-Syler et ah, 2000 J, Mol Med. 78: 
5 426-430. 

In another aspect of the invention, the structure of a constrained peptide aptamer of 
the invention may be determined such as by NMR or X-ray crystallography. (Cavanagh et al., 
Protein NMR Spectroscopy ; Principles and Practice, Academic Press, 1996; Drenth, 
Principles of Protein X-Ray Crystallography, Springer Verlag, 1999) Preferably the structure 

10 is determined in complex with the target protein. A small molecule analog is then designed 
according to the positions of functional elements of the 3D structure of the aptamer. 
(Guidebook on Molecular M odeling in Drug Design, Cohen, Ed., Academic Press, 1996; 
Molecu l ar Modeling and D rug Design (Topics in Molecular a n d Structural Biology), Vinter 
and Gardner Eds., CRC Press, 1994) Thus, a method is provided for the design of effective 

15 and specific drugs which modulate the activity of Dkk, Dkk interacting proteins, Dkk/Dkk 
interacting protein complex, and the Dkk/LRP complex. Small molecule mimics of the 
peptide aptamers of the present invention are also encompassed within the scope of the 
invention. 



20 EXAMPLES 

The present invention is described by reference to the following Examples, which are 
offered by way of illustration and are not intended to limit the invention in any manner. 
Standard techniques well known in the art or the techniques specifically described below 
were utilized. 

25 

Example 1 

The propositus was referred by her physicians to the Creighton Osteoporosis Center 
for evaluation of what appeared to be unusually dense bones. She was 1 8 years old and came 
to medical attention two years previous because of back pain, which was precipitated by an 
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auto accident in which the car in which she was riding as a passenger was struck from 
behind. Her only injury was soft tissue injury to her lower back that was manifested by pain 
and muscle tenderness. There was no evidence of fracture or subluxation on radiographs. 
The pain lasted for two years, although she was able to attend school full time. By the time 
5 she was seen in the Center, the pain was nearly resolved and she was back to her usual 

activities as a high school student. Physical exam revealed a normal healthy young woman 
standing 66 inches and weighing 128 pounds. Radiographs of the entire skeleton revealed 
dense looking bones with thick cortices. All bones of the skeleton were involved. Most 
importantly, the shapes of all the bones were entirely normal. The spinal BMC was 94.48 

10 grams in Ll-4, and the spinal BMD was 1.667 gm/cm 2 in Ll-4. BMD was 5.62 standard 

deviations (SD) above peak skeletal mass for women. These were measured by DXA using a 
Hologic 2000-. Her mother was then scanned and a lumbar spinal BMC of 58.05 grams and 
BMD of 1.500 gm/cm 2 were found. Her mother's values place her 4.12 SD above peak mass 
and 4.98 SD above her peers. Her mother was 51 years old, stood 65 inches and weighed 

15 140 pounds. Her mother was in excellent health with no history of musculoskeletal or other 
symptoms. Her father's lumbar BMC was 75.33 grams and his BMD was 1.118 gm/cm 2 . 
These values place him 0.25 SD above peak bone mass for males. He was in good health, 
stood 72 inches tall, and weighed 187 pounds. 

These clinical data suggested that the propositus inherited a trait from her mother, 

20 which resulted in very high bone mass, but an otherwise normal skeleton, and attention was 
focused on the maternal kindred. In U.S. Patent No. 5,691,153, twenty- two of these 
members had measurement of bone mass by DXA. In one case, the maternal grandfather of 
the propositus, was deceased, however, medical records, antemortem skeletal radiographs 
and a gall bladder specimen embedded in paraffin for DNA genotyping were obtained. His 

25 radiographs showed obvious extreme density of all of the bones available for examination 

including the femur and the spine, and he was included among the affected members. In this 
invention, the pedigree has been expanded to include 37 informative individuals. These 
additions are a significant improvement over the original kinship (Johnson et aL 9 Am. J. Hum. 
Genet, 60:1326-1332 (1997)) because, among the fourteen individuals added since the 
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original study, two individuals harbor key crossovers. X-linkage is ruled out by the presence 
of male-to-male transmission from individual 12 to 14 and 15. 

Example 2 

5 The present invention describes DNA sequences derived from two BAC clones from 

the HBM gene region, as evident in Table 7 below, which is an assembly of these clones. 
Clone b200e21-h (ATCC No. 980812; SEQ ID NOS: 10-1 1) was deposited at the American 
Type Culture Collection (ATCC), 10801 University Blvd., Manassas, VA 201 10-2209 
U.S.A., on December 30, 1997. Clone b527dl2-h (ATCC No. 980720; SEQ ID NOS: 5-9) 
10 was deposited at the American Type Culture Collection (ATCC), 10801 University Blvd., 
Manassas, VA 201 10-2209 U.S.A., on October 2, 1998. These sequences are unique 
reagents that can be used by one skilled in the art to identify DNA probes for the Zmaxl 
gene, PCR primers to amplify the gene, nucleotide polymorphisms in the Zmaxl gene, or 
regulatory elements of the Zmaxl gene. 

15 

Example 3 

Veast-2 Hybrid screen for peptide aptamer sequences to Dkk-1 
Peptide aptamer library construction. A peptide aptamer library, Tpep, was 
constructed, which provides a means to identify chimeric proteins that bind to a protein target 
20 (or bait) of interest using classic yeast two hybrid (Y2H) assays. The Tpep library is a 

combinatorial aptamer library composed of constrained random peptides, expressed within 
the context of the disulfide loop of E. coli thioredoxin (trxA), and as C-termini fusion to the 
S. cerevisiae Gal4 activation domain. The Tpep library was generated using a restriction 
enzyme modified recombinant Y2H prey vector, pPC86 (Gibco), which contains the trxA 
25 scaffold protein. 

Generation of aptamer-encoding sequences. Aptamer-encoding sequences were 
produced as follows. DNA encoding random stretches of approximately sixteen amino acids 
surrounded by appropriate restriction sites were generated by semi-random oligonucleotide 
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synthesis. The synthetic oligonucleotides were PCR-amplified, restriction digested, and 
cloned into the permissive sites within the trxA scaffold protein. The cloning strategy was to 
insert the random oligonucleotide sequence is in-frame with the scaffold protein coding 
sequence, resulting in expression of a scaffold protein-aptamer chimera. The scaffold protein 
5 is itself in-frame with the activation domain of Gal4, within the pPC86 vector that is 
appropriate for the aptamer to be expressed and functional in a regular Y2H assay. 
Additional methods of preparing aptamers would be apparent to the skilled artisan. 

Generation of a permissive recombinant pPC86 vector containing the trxA coding 
sequence. First the RsrU restriction site located within the Gal4 activation domain of pPC86 

10 (Gibco) was eliminated by site-directed mutagenesis (Quickchange™ kit, Stratagene). The 

amino acid sequence of the Gal4 activation domain was unchanged by this modification. The 
strength of different control interactions was verified to be unchanged by the modification. 

Second, the E. coli trxA coding sequence was cloned into the Sail and NotI sites of 
the ifcsrll-modified pPC86. EcdKL and Spel sites were then introduced within the trxA RsrU 

15 site. The oligonucleotides encoding the peptide aptamers were cloned into the EcoRI and 
Spel sites of the resulting vector. 



TABLE 13 



Contig 


ATCC No. 


SEQID 
NO. 


Length 
(base pairs) 


b527dl2-h contig302G 


980720 


5 


3096 


b527dl2-h contig306G 


980720 


6 


26928 


b527dl2-h contig307G 


980720 


7 


29430 


b527dl2-h contig308G 


980720 


8 


33769 


b527dl2-h contig309G 


980720 


9 


72049 


b200e21-h contigl 


980812 


10 


8705 


b200e21-h contig4 


980812 


11 


66933 



Example^ 
Generation of antibodies 

30 In each of the following antibody-generating examples, the synthesis of these linear 

peptides is followed by injection into two New Zealand Rabbits. Subsequent boosts and 

207 



BNSDOCID: <WO 02092000 A2_L> 



WO 02/092000 



PCT/US02/14877 



bleeds are taken according to a standard ten-week protocol. The end-user receives back 5 

mgs of peptide, aliquots of pre-bleeds, roughly 80 ml of crude sera from each of the two 

rabbits and, and ELIS A titration data is obtained. 

Generation of LRPS Polymorphism-specific antibodies. Antibodies were generated 
5 to the following peptides to obtain antibodies which distinguish the HBM polymorphism 

versus wild-type LRP5/Zmax: MYWTDWVETPRIE (SEQ ID NO: ) (mutant peptide) and 

MYWTDWGETPRIE (SEQ ID NO: ) (wild-type peptide for negative selection). 

Immunofluorescence data confirmed that the antibody, after affinity purification, is specific 

for HBM and does not recognize LRPS (Fig. 32). 
10 Generation of LRPS Monospecific antibodies. LRPS monospecific polyclonal 

antibodies were generated to the following amino acid sequences of LRPS: Peptide 1 (a.a. 

265-277) - KRTGGKRKEELSA (SEQ ID NO: ), Peptide 2 (a.a. 1 178-1 194) - 

ERVEKTTGDKRTRIQGR (SEQ ID NO: ), and Peptide 3 (a.a. 1352-1375) - 

KQQCDSFPDCIDGSDE (SEQ ID NO: ). Immunofluorescence confirmed that the antibody 
15 generated detects LRPS. 

Generation of Dkk-1 monospecific polyclonal antibodies. Dkk-1 monospecific 

polyclonal antibodies were generated to the following amino acid sequences of Dkk-1 : 

Peptide 1 - GNKYQTIDNYQPYPC (SEQ ID NO: ), 

Peptide 2- LDGYSRRTTLSSKMYHTKGQEG (SEQ ID NO: ), 
20 Peptide 3 - RIQKDHHQASNSSRLHTCQRH (SEQ ID NO: ), 

Peptide 4 - RGEIEETITESFGND (SEQ ID NO: ), and 

Peptide 5 - EIFQRCGEGLSCRIQKD (SEQ ID NO: ) of human Dkk-1. Western Blots 
demonstrated that the antibodies generated against peptides 2 and 4 are specific toward Dkk- 
1. 

25 

Example 5 

Effects of exogenous Dkk-1 an Witt-mediated signaling in the Xenopns em bryo assay 
Xenopus embryos are an informative and well-established in vivo assay system to 
evaluate the modulation of Wnt signaling (McMahon et aL, 1989 Cell 58: 1075-84; Smith et 
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al, 1991 reviewed in Wodarz andNusse, 1998). 

Modification of the Wnt signaling pathway can be visualized by examining the 
embryos for a dorsalization phenotype (duplicated body axis) after RNA injection into the 
ventral blastomere at the 4- or 8-cell stage. On the molecular level, phenotypes can be 
5 analyzed by looking for expression of various marker genes in stage 10.5 embryos. Such 
markers would include general endoderm, mesoderm, and ectoderm markers as well as a 
variety of tissue-specific transcripts. 

Analysis can be done by RT-PCR/TaqMan® and can be done on whole embryo tissue 
or in a more restricted fashion (microdissection). Because this system is very flexible and 

10 rapid, by injecting combinations of transcripts, such as HBM and different Wnts or Wnt 
antagonists, the mechanism of HBM in the Wnt pathwaycan thereby be dissected. 
Furthermore, investigations are conducted to determine whether Zmax/LRP5 and HBM 
differentially modulate Wnt signaling either alone, or in combination with other components. 
Previous studies have demonstrated that LRP6 alone or LRP5 + Wnt5a were able to induce 

15 axis duplication (dorsalization) in this system (Tamai et aL, 2000 Nature 407: 530-35). 

Constructs for Xenopus Expression (Vector pCS2*). Constructs were prepared using 
the vector pCS2 + . DNA inserts were subcloned in the sense orientation with respect to the 
vector SP6 promoter. The pCS2 + vector contains an SV40 virus polyadenylation signal and 
T3 promoter sequence (for generation of antisense mRNA) downstream of the insert. 

20 Full length Zmax/LRP5 and HBM ORF cDNA : Insert cDNA was isolated from the 

full length cDNA retrovirus constructs (with optimized Kozak sequences) by Bglll-EcoKL 
digestion and subcloned into the BamHI-EcdBI sites of the pCS2 + vector. cDNAs encoding a 
HBM-like molecule could be subcloned into pCS2 + vectors and processed similarly by one of 
ordinary skill. 

25 Full length XWntS: This cDNA was PCR amplified from a Xenopus embryo cDNA 

library using oligos 1 14484 (SEQ ID NO: ) (S'-CAGTGAATTCACCATGCAAAACACC 
ACTTTGTTC-3') and 114487 (SEQ ID NO: ) (5 f -CAGTTGCGGCCGCTCATCTCCGGTG 
GCCTCTG-3'). The oligos were designed to amplify the ORF with a consensus Kozak 
sequence at the 5 f end as determined from GenBank #X57234. PCR was carried out using 
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the following conditions: 96°C, 45 sec; 63 °C, 45 sec; 72°C, 2 min. for 30 cycles. The 
resulting PCR product was purified, subcloned into pCRII-TOPO (Invitrogen Corp.), 
sequence verified, and digested with BamHI/XIioI. This insert was subcloned into the vector 
at the BamHL-XJiol sites. 
5 Full length Wnt5a: A murine Wnt5a cDNA clone was purchased from Upstate 

Biotechnology (Lake Placid, NY) and subcloned into the EcoRl site of the vector. 
Sequencing confirmed insert orientation. 

Full length human Dkk-1 : A human cDNA with GenBank accession number 
AF127563 was available in the public database. Using this sequence, PCR primers were 
10 designed to amplify the open reading frame with a consensus Kozak sequence immediately 
upstream of the initiating ATG. Oligos 1 17162 (SEQ ID NO: ) (5'- 

CAATAGTCGACGAATTCACCATGGCTCTGGGCGCAGCGG-3 1 ) and 117163 (SEQ ID 
NO: ) (5 f -GTATTGCGGCCGCTCTAGATTAGTGTCTCTGACAAGTGTGAA-3 f ) were 
used to screen a human uterus cDNA library by PCR. The resulting PCR product was 

15 purified, subcloned into pCRII-TOPO (Invitrogen Corp.), sequence verified, and digested 
with EcoRVXfiol. This insert was subcloned into the pCS2 + vector at the EcoRl-Xhol sites. 

Full length human Dkk-2: A full length cDNA encoding human Dkk-2 was isolated 
to investigate the specificity of the Zmax/LRP5/HBM interaction with the Dkk family of 
molecules. Dkk-1 was identified in yeast as a potential binding partner of 

20 Zmax/LRP5/HBM. Dkk-1 has also been shown in the literature to be an antagonist of the 
Wnt signaling pathway, while Dkk-2 is not (Krupnik et aL, 1999). The Dkk-2 full length 
cDNA serves as a tool to discriminate the specificity and biological significance of 
Zmax/LRP5/HBM interactions with the Dkk family {e.g., Dkk-1, Dkk-2, Dkk-3, Dkk-4, 
Soggy, their homologs and variant, etc.). A human cDNA sequence for Dkk-2 (GenBank 

25 Accession No. NMJH4421) was available in the public database. Using this sequence, PCR 
primers were designed to amplify the open reading frame with a consensus Kozak sequence 
immediately upstream of the initiating ATG. Oligos 51409 (SEQ ID NO: ) (5 f - 
CT AACGGATCC ACCATGGCCGCGTTGATGCGG-3 ') and 51411 (SEQ ID NO: ) (5'- 
GATTCGAATTCTC AAATTTTCTGAC AC AC ATGG-3 f ) were used to screen human 
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embryo and brain cDNA libraries by PGR. The resulting PCR product was purified, 
subcloned into pCRH-TOPO, sequence verified, and digested with BamHUEcoRL This insert 
was subcloned into the pCS2 + vector at the BaniHL-EcoRI sites. 

Full length LRP6 was isolated from the pED6dpc4 vector by Xliol-Xbal digestion. 
5 The full length cDNA was reassembled into the XJtoI-Xbal sites of pCS2 + . Insert orientation 
was confirmed by DNA sequencing. 

mRNA Synthesis and Microinjection Protocol. mRNA for microinjection into 
Xenopus embryos is generated by in vitro transcription using the cDNA constructs in the 
pCS2 + vector described above as template. RNA is synthesized using the Ambion mMessage 

10 mMachine high yield capped RNA transcription kit (Cat. #1 340) following the 

manufacturer's specifications for the Sp6 polymerase reactions. RNA products were brought 
up to a final volume of 50 \x\ in sterile, glass-distilled water and purified over Quick Spin 
Columns for Radiolabeled RNA Purification G50-Sephadex (Roche, Cat. #1274015) 
following the manufacturer's specifications. The resulting eluate was finally extracted with 

15 phenol:chloroform:isoamyl alcohol and isopropanol precipitated using standard protocols 

(Sambrook et aL, 1989). Final RNA volumes were approximately 50 pi. RNA concentration - 
was determined by absorbance values at 260 nm and 280 nm. RNA integrity was visualized 
by ethidium bromide staining of denaturing (formaldehyde) agarose gel electrophoresis 
(Sambrook et aL, 1989). Various amounts of RNA (2 pg to 1 ng) are injected into the ventral 

20 blastomere of the 4- or 8-cell Xenopus embryo. These protocols are described in Moon et aL, 
1989 Techmque-J. of Methods in Cell & MoL Biol. 1: 76-89; and Peng, 1991 Meth. Cell. 
Biol. 36: 657-62. 

Screening for Duplicated Body Axis. In vitro transcribed RNA is purified and 
injected into a ventral blasomere of the 4- or 8-cell Xenopus embryo (approx. 2 hours post- 
25 fertilization). At stage 10.5 (approx. 11 hours post-fertilization), the injected embryos are 
cultured for a total of 72 hours and then screened for the presence of a duplicated body axis 
(dorsalization) (Fig. 33). Using XWnt8-injected (2-10 pg) as a positive control (Christian et 
aL, 1991) and water-injected or non-injected embryos as negative controls, we replicated the 
published observation that Zmax(LRP5) + Wnt5a (500 and 20 pg, respectively) could induce 
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axis duplication. Wnt5a (20 pg) alone could not induce axis duplication (as previously 
reported by Moon et al y 1993). We have also injected GFP RNA (100-770 pg) as a negative 
control to show that the amount of RNA injected is not perturbing embryo development (not 
shown). Strikingly, HBM + Wnt5a (500 and 20 pg, respectively) yielded an approximately 
5 3.5 fold more robust response of the phenotype (p—0.043 by Fisher's exact test) compared to 
Zmax(LRP5) + Wnt5a, suggesting that the HBM mutation is activating the Wnt pathway 
(Figs. 34 and 35). The HBM/Wht5a embryos also appear to be more "anteriorized" than the 
Zmax(LRP5)/Wnt5 a embryos, again suggestive of a gain-of-function mutation. 

The role of Dkk-1 as a modulator of Zmax/LRP5- and HBM-mediated Wnt signaling 

10 was investigated. Literature reports have previously characterized Xenopus and murine Dkk- 
1 as antagonists of the canonical Wnt pathway in the Xenopus system (Glinka et aL 9 1998 
Nature 391: 357-62). Using the human Dkk-1 construct, a dose-response assay was 
performed to confirm that our construct was functional and to identify the optimal amount of 
RNA for microinjection. Using 250 pg/embryo of hDkk-1 RNA, over 90% (pO.OOl) of the 

15 embryos were observed to display enlarged anterior structures (big heads) as anticipated from 
the published reports (Fig. 36). 

The mechanism of hDkk-1 modulation of Wnt signaling in the presence of 
Zmax/LRP5 or HBM was also investigated. Without any hDkk-1 present, it was confirmed 
that HBM + Wnt5a was a more potent activator of Wnt signaling than Zmax/LRP5 + Wnt5a 

20 (p<6.05). Interestingly, in the presence of hDkk-1 (250 pg), Zmax/LRP5-mediated Wnt 
signaling was repressed (p<0.05) but hDkk-1 was unable to repress HBM-mediated Wnt 
signaling (pO.Ol) (Fig. 37). The specificity of this observation can be further addressed by 
investigating other members of the Dkk family, other Wnt genes, LRP6, additional 
Zmax/LRP5 mutants, and the peptide aptamers. 



25 



Example 6 

Effects of exogenous Dkk and L>RP5 on Wnt signaling in the TCF-lnciferase Assay 
Wnt activity can be antagonized by many proteins including secreted Frizzled related 
proteins (SFRPs), Cerberus, Wnt Inhibitory Factor- 1 and Dkk-1 (Krupnik et aL, 1999). The 
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Dkk family of proteins consists of Dkk- 1-4 and Soggy, a Dkk-3-like protein. Dkk-1 and 
Dkk-4 have been shown to antagonize Wnt mediated Xenopus embryo development, whereas 
Dkk-2, Dkk-3, and Soggy do not. Unlike many of these proteins that antagonize Wnt activity 
by directly interacting with Wnt proteins, Dkk-1 acts by binding to two recently identified 
5 Wnt coreceptors, LRP5 and LRP6 (Mao et al , 200 1 ; Bafico et al , 200 1 ). The details of this 
interaction have been examined by the present inventors and Mao et al. using deletion 
constructs of LRP6, which demonstrated that EGF repeats 3 and 4 are important for Dkk-1 
interaction. Accordingly, the activity of two Dkk proteins, Dkk-1 and Dkk-2, were 
investigated with various Wnt members, LRP5, LRP6, and the mutant form of LRP5, 

10 designated HBM. The present invention explores whether there is any functional difference 
between LRP5 and HBM with regard to Dkk action on Wnt mediated signaling. Various 
reagents were developed, including Dkk-1 peptides, constrained LRP5 peptide aptamers, 
constrained Dkk-1 peptide aptamers and polyclonal antibodies to Dkk-1 (in Example 4 
above) to identify factors that mimic HBM mediated Wnt signaling. 

15 Methods. Various LRP5 constrained peptides were developed. Specifically, four 

peptides that interact with the LBD of LRP5 (Fig. 38,constructs OST259-262 in Fig. 39) and 
three peptides that interact with the cytoplasmic domain of LRP5 (constructs OST266- 
OST268 in Fig. 39). In addition two Dkk-1 peptides were developed: constructs OST264 
and OST265 in Fig. 39, corresponding to Dkk-1 amino acids 139-266 and 96-245, containing 

20 the smallest region of Dkk-1 that interacts with LRP5 (Fig. 40). The cDNA clones encoding 
the LRP5 LBD interacting peptides and the Dkk-1 peptides were subcloned into pcDNA3.1 
with the addition of a Kozak and signal sequence to target the peptide for secretion. The 
constructs encoding the three peptides interacting with the cytoplasmic domain of LRP5 were 
also subcloned into pcDNA3.1. However, these latter constructs do not contain a signal 

25 sequence. 

HOB-03-CE6 osteoblastic cells developed by Wyeth Ayerst (Philadelphia, PA) were 
seeded into 24-well plates at 150,000 cells per well in 1 ml of the growth media (D- 
MEM/F12 phenol red-free) containing 10% (v/v) heat-inactivated FBS, IX penicillin 
streptomycin, and IX Glutamax-1, and incubated overnight at 34°C. The following day, the 
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cells were transfected using Lipofectamine 2000® (as described by the manufacturer, 
Invitrogen) in OptiMEM (Invitrogen) with 0.35 ix% /well of LRP5, HBM, or control plasmid 
DNA (empty vector pcDNA3. 1) and either Wntl or Wnt3a plasmid DNA. Similar 
experiments were performed with LRP6 plasmid DNA (0.35jUg/well) or a control pEDdpc4 
5 empty vector. Furthermore, each of these groups were then divided into three groups, those 
receiving 0.35 jug/well Dkk-1, Dkk-2, or pcDNA3.1 control DNA. All wells were 
transfected with 0.025 /^g/well of CMV beta-galactosidase plasmid DNA and 0.35 /ig/well 
16X TCF(AS>luciferase reporter DNA (developed by Ramesh Bhat, Wyeth-Ayerst 
(Philadelphia, PA)). After 4 hours of incubation, the cells were rinsed and 1 ml of fresh 

10 growth media was added to each well. The cells were cultured overnight at 34 °C, followed 
by a wash and a change of media. Cells were cultured for an additional 18-24 hours at 37 °C. 
Cells were then lysed with 50 /zl/well of IX lysis buffer. The extracts were assayed for beta- 
galactosidase activity (Galacto Reaction Buffer Diluent & Light Emission Accelerator, 
Tropix) using 5 /A extract + 50 pel beta-galactosidase diluent and luciferase activity 

15 (Luciferase Assay Reagent, Promega) using 20 {A extract. 

U20S human osteosarcoma cells were also utilized. U20S cells (ATCC) were 
seeded into 96-well plates at 30,000 cells per well in 200 jxl of the growth media (McCoy's 
5A) containing 10% (v/v) heat-inactivated FBS, IX penicillin streptomycin, and IX 
Glutamax-1, and incubated overnight at 37°C. The following day, the media was replaced 

20 with OptiMEM (Invitrogen) and cells were transfected using Lipofectamine 2000® (as 
described by the manufacturer, Invitrogen) with 0.005 fig/well of LRP5, HBM, LRP6 or 
contol plasmid DNA (empty vector pcDNA3.1) and either Wntl (0.0025 jig/well) or Wnt3a 
(0.0025 ^ig/well) plasmid DNA. In addition, the 16x-(AS) TCF-TK-firefly-luciferase and 
control TK-renilla luciferase (Promega Corp.) were co-transfected at 0.3 jig/well and 0.06 

25 (ig/well respectively in all experiments. Furthermore, each of these groups was then divided 
into different groups, those receiving 0.05 fig/well Dkk-1, Dkk-2, Dkk3, Dkkl -Alkaline 
Phosphatase (AP), mutant Dkk-1 (C220A), Soggy or pcDNA3.1 control DNA. In other 
experiments, cells were co-transfected with 0.005 ^ig/well of LRP5, 0.0025 |Lig/well of Wntl 
or Wnt3a (using 0.0025 ^ig/well of a control pcDNA3.1) with LRP5-interacting aptamers 
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(0.05 jig/well). Cells were cultured for an additional 18-20 hours at 37 °C. Culture medium 
was removed. Cells were cultured for an additional 18-20 hours at 37 °C. Culture medium 
was removed. Cells were then lysed with 100 yul/well of IX Passive Lysis Buffer (PLB) of 
Dual Luciferase Reagent kit (DLR-kit-Promega Corp.) 20 jul of the lysates were combined 
5 with LARII reagent of DLR-kit and assayed for TCF-firefly luciferase signal in Top Count 
(Packard) instrument. After measuring the Firefly readings, 100 fA of the "Stop and Glo" 
reagent of DLR kit that contains a quencher and a substrate for renilla luciferase was added 
into each well. Immediately the renilla luciferase reading was measured using the Top Count 
(Packard) Instrument. The ratios of the TCF-firefly luciferase to control renilla readings 
10 were calculated for each well and the mean ratio of triplicate or more wells was expressed in 
all data. 

Results. The results of these experiments demonstrate that Dkk-1, in the presence of 
Wntl and LRP5, significantly antagonized TCF-luciferase activity (Fig. 41). In marked 
contrast, Dkk-1 had no effect on HBM/Wntl mediated TCF-luciferase activity (Fig. 41). In 

15 similar experiments, Dkk-1 was also able to antagonize LRP5/Wnt3a but not HBM/Wnt3a 
mediated TCF-luciferase activity (Fig. 42). These results indicate that the HBM mutation 
renders Dkk-1 inactive as an antagonist of Wntl and Wnt3a signaling in HOB03CE6 
osteoblastic cells. In other experiments with Wntl, Dkk-1 had no effect on LRP5 or HBM 
mediated TCF-luciferase activity (Fig. 41). In contrast, with either LRP5 or HBM in the 

20 presence of Wnt3a, Dkk-2 was able to antagonize the TCF-luciferase activity (Fig. 42). 

These latter results indicate that the HBM mutation has no effect on Dkk-2 action in the 
presence of Wnt3a. Experiments were also performed using the closely related LRP6 cDNA 
in HOB-03-CE6 cells. In these experiments, LRP6/Wntl and LRP6/Wnt3a mediated TCF- 
luciferase were regulated in the same manner as LRP5. Specifically, Dkk-1 antagonized 

25 LRP6/Wntl mediated TCF-luciferase activity, whereas Dkk-2 had no effect (Fig. 41). 

However, similar to the action of Dkk-2 with LRP5/Wnt3a, Dkk-2 was able to antagonize 
LRP6/Wnt3a mediated TCF-luciferase activity (Fig. 42). 

The results in the U20S cells show a robust effect of the OST262 LRP5 peptide 
aptamer activation of Wnt signaling in the presence of Wnt3a (Fig. 43). These functional 
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results are confirmed by the results shown below in Example 7 using LRP5 peptide aptamers 
in the Xenopus assay. Such results affirmatively demonstrate that the effects of small 
molecules on LRP5/LRP6/HBM signaling can be detected using the TCF-luciferase assay. 

These data demonstrate that there is a functional difference between LRP5 and HBM 
5 regarding the ability of Dkk-1 to antagonize Wntl and Wnt3a signaling. These data and 

previous data showing that Dkk-1 directly interacts with LRP5 suggests that the inability of 
Dkk-1 to antagonize HBM/Wnt signaling may in part contribute to the HBM phenotype. 
These experiments further demonstrate the ability to test various molecules (e.g., small 
molecules, aptamers, peptides, antibodies, LRP5 interacting proteins or Dkk-1 interacting 
10 proteins, and the like) for a LRP5 ligand that mimics HBM mediated Wnt signaling or factors 
that block Dkk-1 interaction with LRP5. 

This was the assay that was used to show the responsiveness of two HBM-like 
variants. See Figs. 29 and 30. The data also demonstrates that these variants are less 
susceptible to modulation by Dkk. 

15 

Example 7 

Cell-Based Functional High-Throughput Assay 
To develop a high throughput assay, the TCF-luciferase assay described in Example 6 
was modified utilizing low level expression of endogenous LRP5/6 in U20S and HEK293 

20 cells. However, HOB-03-CE6 cells and any other cells which show a differential response to 
Dkk depending on whether LRP5, LRP6 or HBM are expressed. Using U20S (human 
osteosarcoma) and HEK293 (ATCC) cells, the TCF-luciferase and tk-Renilla reporter 
element constructs were co-transfected along with Wnt3a/1 and Dkk. Wnt3a alone, by using 
endogenous LRP5/6, was able to stimulate TCF reporter gene activation. When Dkk, is co- 

25 transfected with Wnt3a/Wnt 1 and reporters (TCF-luci and tk-Renilla), Dkk represses 

reporter element activity. In addition, the TCF-luci signal is activated by Wnt3a/Wntl can be 
repressed by the addition of Dkk-enriched conditioned media to the cells containing 
Wnt3a/Wntl and reporters. The assay is further validated by the lack of TCF-reporter 
inhibition by a point mutant construct (C220A) of Dkkl. 
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The Dkk-mediated repression of the reporter is dependent upon the concentration of 
transfected Dkk cDNA or on the amount of Dkk-conditioned media added. In addition, the 
Dkk-mediated reporter suppression can be altered by the co-transfection of LRP5, LRP6, and 
HBM cDNAs in the U20S or HEK293 cells. In general, U20S cells show greater sensitivity 
5 to Dkk-mediated reporter suppression than that in HEK-293 cells. In U20S cells, the 

transfection of LRP5/LRP6/HBM/HBM-like cDNA leads to moderate activation of TCF-luci 
in the absence of Wnt3a/Wntl transfection. This activation presumably utilizes the 
endogenous Wnts present in U20S cells. Under this condition, Dkkl can repress TCF-luci 
and shows a differential signal between LRP5 and HBM. By co-transfecting Wnt3a/Wntl, 

10 there is a generalized increase in the TCF-luci signal in the assay. Further, one can detect 
Dkk-mediated differential repression of the reporter due to LRP5 and HBM cDNA 
expression as well as between LRP5 and LRP6 cDNA. The repression is maximal with 
LRP6, moderate with LRP5, and least with HBM cDNA expression. In addition, the assay 
can detect the functional impact of the LRP5 interacting peptide aptamers (Fig. 38), Dkkl 

15 interacting aptamers and binding domains of Dkk-1 (Fig. 40; OST264 and OST265 of Figs. 
39 and 44). 

Using this system with a suppressed Wnt-TCF signal due to the presence of both Dkk 
and Wnt3a, one can screen for compounds that could alter Dkk modulation of Wnt signaling, 
by looking for compounds that activate or the TCF-luciferase reporter, and thereby relieve 

20 the Dkk-mediated repression of the Wnt pathway. Such compounds identified may 

potentially serve as HBM-mimetics and be useful, for example, as osteogenic therapeutics. 
Data generated from this high throughput screen are demonstrated in Figs. 45-47. Fig. 45 
shows that Dkkl represses Wnt3a-mediated signaling in U20S bone cells. Fig. 46 
demonstrates the functional differences between LRP5, LRP6, and HBM. Dkk-1 represses 

25 LRP6 and LRP5 but has little or no effect on HBM-generated Wntl signaling in U20S cells. 
Fig. 47 demonstrates the differential effects of various Dkk family members and modified 
Dkks, including Dkk-1, a mutated Dkk-1 (C220A), Dkk-l-AP (modified with alkaline 
phosphatase), Dkk-3, and Soggy. 
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Example 8 

DKK/LRP5/6/HBM/HBM-like ELISLA Assay 
A further method to investigate Dkk binding to LRP or HBM and HBM like 
polypeptides is via ELISA assay. Two possible permutations of this assay are exemplified. 
5 LRP5 is immobilized to a solid surface, such as a tissue culture plate well. One skilled in the 
art will recognize that other supports such as a nylon or nitrocellulose membrane, a silicon 
chip, a glass slide, beads, etc. can be utilized. In this example, the form of LRP5 used is 
actually a fusion protein where the extracellular domain of LRP5 is fused to the Fc portion of 
human IgG. The LRP5-Fc fusion protein is produced in CHO cell extracts from stable cell 
10 lines. The LRP5-Fc fusion protein is immobilized on the solid surface via anti-human Fc 

antibody or by Protein-A or Protein G-coated plates, for example. The plate is then washed 
to remove any non-bound protein. Conditioned media containing secreted Dkk protein or 
secreted Dkk-epitope tagged protein (or purified Dkk or purified Dkk-epitope tagged protein) 
is incubated in the wells and binding of Dkk to LRP is investigated using antibodies to either 
15 Dkk or to an epitope tag. Dkk-V5 epitope tagged protein would be detected using an alkaline 
phosphatase tagged anti-V5 antibody. 

Alternatively, the Dkk protein could be directly fused to a detection marker, such as 
alkaline phosphatase. Here the detection of the Dkk-LRP interaction can be directly 
investigated without subsequent antibody-based experiments. The bound Dkk is detected in 
20 an alkaline phosphatase assay. If the Dkk-alkaline phosphatase fusion protein is bound to the 
immobilized LRP5, alkaline phosphatase activity would be detected in a colorimetric 
readout. As a result, one can assay the ability of small molecule compounds to alter the 
binding of Dkk to LRP using this system. Compounds, when added with Dkk (or epitope- 
tagged Dkk) to each well of the plate, can be scored for their ability to modulate the 
25 interaction between Dkk and LRP based on the signal intensity of bound Dkk present in the 
well after a suitable incubation time and washing. The assay can be calibrated by doing cold 
competition experiments with unlabeled Dkk or with a second type of epitope-tagged Dkk. 
Any small molecule that is able to modulate the Dkk-LRP interaction may be a suitable 
therapeutic candidate, more preferably an osteogenic therapeutic candidate. 
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Example 9 

Functional Evaluation of Peptide Aptamers in Xenopus 
The constrained peptide aptamers constructs OST258-263 (where 258 contains the 
signal sequence by itself and 263 contains an irrelevant constrained peptide) (Figs. 39 and 
5 44) were used to generate RNA substantially as described in Example 6, except the vector 

was linearized by restriction endonuclease digestion and RNA was generated using T7 RNA 
polymerase. 

Aptamer RNA was injected at 250 pg per blastomere using the protocol of Example 
6. Wnt signaling was activated, as visualized by embryo dorsalization (duplicated body axis) 

10 with aptamers 261 and, more strongly, 262. The results of this assay are shown in Figs. 48 
and 49. These results suggest that aptamers 261 and 262 are able to activate Wnt signaling 
possibly by binding to the LBD of LRP, thereby preventing the modulation of LRP-mediated 
signaling by Dkk. < 
The aptamers of the present invention can serve as HBM-mimetics. In the Xenopus 

15 system they are able to induce Wnt signaling all by themselves. They may also serve as tools 
for rational drug design by enhancing the understanding of how peptides are able to interact 
with LRP and modulate Wnt signaling at the specific amino acid level. Thus, one would be 
able to design small molecules to mimic their effects as therapeutics. In addition, the 
aptamers identified as positives in this assay may be used as therapeutic molecules 

20 themselves. 

Example 10 
Homogenous Assay 

An excellent method to investigate perturbations in protein-protein interactions is via 
25 Fluorescence Resonance Energy Transfer (FRET). FRET is a quantum mechanical process 
where a fluorescent molecule, the donor, transfers energy to an acceptor chromophore 
molecule which is in close proximity. This system has been successfully used in the 
literature to characterize the intermolecular interactions between LRP5 and Axin (Mao et aL, 
Molec. CellBioL 7: 801-9). There are many different fluorescent tags available for such 
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studies and there are several ways to fluorescently tag the proteins of interest. For example, 
CFP (cyan fluorescent protein) and YFP (yellow fluorescent protein) can be used as donor 
and acceptor, respecively. Fusion proteins, with a donor and an acceptor, can be engineered, 
expressed, and purified. 

5 For instance, purified LRP protein, or portions or domains thereof, fused to CFP and 

purified Dkk protein, or portions or domains thereof that interact with Dkk or LRP 
respectively, fused to YFP can be generated and purified using standard approaches. If LRP- 
CFP and Dkk- YFP are in close proximity, the transfer of energy from CFP to YFP will result 
in a reduction of CFP emission and an increase in YFP emission. Energy is supplied with an 

10 excitation wavelength of 450 nm and the energy transfer is recorded at emission wavelengths 
of 480 nm and 570 nm. The ratio of YFP emission to CFP emission provides a guage for 
changes in the interaction between LRP and Dkk. This system is amenable for screening 
small molecule compounds that may alter the Dkk-LRP protein-protein interaction. 
Compounds that disrupt the interaction would be identified by a decrease in the ratio of YFP 

15 emission to CFP emission. Such compounds that modulate the LRP-Dkk interaction would 
then be considered candidate HBM mimetic molecules. Further characterization of the 
compounds can be done using the TCF-luciferase or Xenopus embryo assays to elucidate the 
effects of the compounds on Wnt signaling. 

While the above example describes a cell-fee, solution-phase assay using purified 

20 components, a similar cell-based assay could also be performed. For example, LRP-CFP 

fusion protein can be expressed in cells. The Dkk- YFP fusion protein then could be added to 
the cells either as purified protein or as conditioned media. The interaction of LRP and Dkk 
is then monitored as described above. 

25 Example 11 

Identification of Variants of LRP5/Zmax1/HB M 

Because the YWTD repeats constitute the major part of domain which interacts with 
Dkk-1, a search was begun from the repeats to look for protein folds that contain such 
repeats. Springer, 1998 J. Mol Biol 283: 837-62 proposed that these YWTD repeats, which 
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had been previously described as "spacers" and considered to have no defined structure, are 
in fact the propeller-blade subdomains of highly structured six-bladed p-propellers. Springer 
(1998) therefore proposed two theoretically modeled protein structures with such YWTD 
repeats. 

5 The Model. A set of LRP5 propeller domain sequences from mouse and human were 

assembled and aligned using CLUSTALW, together with the sequences corresponding to two 
of the modeled YWTD-propeller structures of Springer (1998) [llpx, based on the LRP1 p- 
propeller domain no. 7 from chicken (SwissProt LRP1_CH7), and lndx, based on human 
nidogen (Swiss-Prot NIDO _HU1)]. The alignment was manually edited based on the 

10 rationale and knowledge of protein structures. Fig. 50 depicts a schematic model of LRP5. 

The secondary structures consist exclusively of P-strands and turns. The secondary structure 
assignments were verified by the predictive program, DSC (Discrimination of Protein 
Secondary Structure Class available at http://bioweb.pasteur.fr/seqanal/interfaces/dsc.htail). 
Preliminary checks of the exon-intron boundaries were performed manually. A tertiary- 

15 structure homology model for human LRP5 using the llpx model as a template was built 
with Insightn (Accelrys Inc., San Diego, CA) and examined using the graphics display 
programs from Insight II and Rasmol (freely available at 
http://www.umass.edxi/microbio/rasmol/index2.html). 

Results. Based on the homology alignments and modeling, the domain diagram of 

20 LRP5 in Fig. 50 was obtained. Fig. 51 shows the complete alignment and secondary 

structure assignments for the propeller domains of mouse LRP5 and LRP6, human LRP5 and 
human LRP6, and the sequences corresponding to the two theoretical models constructed by 
Springer (1998). 

To a good first approximation, the sequence alignments support the proposal that 
25 there are four 6-bladed P-propeller domains in human LRP5. This model nicely 

accommodates the sequence (as should be expected since it derives from a carefully 
constructed model of a chicken LRP propeller domain). The overall shape of this domain 
resembles a disk with inward-sloping sides and a hole down the middle. The polypeptide 
chain enters and leaves from the same (comparatively flat) "bottom" surface. As indicated in 
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the alignment (Fig. 51) and in several of the structure illustrations, the G171 V mutation falls 
into a loop on the outer or 'top" face of the domain. This immediately suggests that - 
contrary to what one might expect, given the role of highly conserved glycine residues in 
determining protein folds - the mutation should not have any significant effect on the 
5 domain's tertiary structural stability. It is rather more likely that the mutation interferes with 
binding of some ligand, possibly a macromolecule (Smith et ah, 1999 Trends Biochem. Sci. 
24: 181-5) or alters a protein-protein interaction. 

Closer scrutiny of the protein in the vicinity of G171 shows that this residue sits at the 
bottom of a pocket on the outer edge of the top surface of the domain. Moreover, one side of 

10 this pocket consists of a cluster of very hydrophobic side chains, while the other side has 
more polar groups, most notably glutamate 172. In the mutant form of the protein, with 
VI 71 in place of the glycine, the pocket largely disappears as the isopropyl group of valine 
replaces the hydrogen side chain of glycine. It is easy to understand how such a substitution 
could seriously disrupt ligand binding if the ligand normally protrudes into the pocket. By 

15 the same rationale, any other mutations that block such pockets in any of the four propellers 
could also result in impaired ligand binding. 

The reasonableness of the proposed model is based on Springer's results as well as 
additional data. For example, data derived from the typical features of such protein domains 
(including the well-studied YWTD repeats), e.g., that are robust and rigid tertiary structures. 

20 Loop conformations may vary somewhat, especially for the longer loops, but the basic 

features of the protein scaffold are almost certainly well predicted. Of course we would be 
on somewhat more secure ground if there were a crystal structure for one of these domains. 
That naturally implies that some of the more interesting features, such as the identity of 
residues exposed on the outer, putative ligand-binding surface, will also be well-predicted. 

25 Note that in Fig. 5 1 these residues are marked in red on the sequences corresponding to 

Springer's (1998) models. Even though different models could possibly be built that are not 
dependent on Springer's assumptions and lead to a different topology of the propeller, this 
would not alter the conclusion that the G171 V mutation lies in a surface loop. 

At the very least, the model affords an opportunity to think about this protein in 
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molecular terms and that should facilitate both experimental design and evaluation of results, 
i.e., candidate epitopes might be selected more rationally. One interesting aspect, evoked 
forcefully by the diagram in Fig. 50, is the question of how the different modular propeller 
domains interact. The EGF-like domains are well-studied (dozens of crystal and NMR 
5 structures exist, see for example Bork et al, 1996 Quart Rev. Biophys. 29: 1 19-67). The 

EGF-like domains consist mainly of a couple of p-hairpins, cross-linked by disulfide bonds, 
and in some cases they have tightly bound calcium ions, which must further stabilize their 
structure. Interestingly, in contrast to the YWTD propellers, the EGF-like domains have the 
polypeptide chain entering at one end and exiting at the other. Thus, mechanically, these 
10 domains could act as spacers, even swivels (since rotation can occur around the tails that 
extend from either end). Given the size of the propellers, there are almost certainly inter- 
domain interactions between them as they assemble in some higher-order structure, 
connected by the EGF-like domains. Because of the topology of the propellers, their "top" 
faces would have to face outward since the EGF-like domains will tie them together by their 
15 protruding N- and C-terminal extensions. 

A more sophisticated analysis of the (i-propeller structure was modeled using X-ray 
ctystallographic data from the LDL receptor. The primary amino acid sequences of the 
various beta propeller domains of LRP5 (and other domains) were used to develop homology 
models of their 3-dimensional structures according to the following method: 
20 (A) Search for suitable structural templates and check sequence identity 

with target. The ExNRL-3D database (derived from the Protein Data Bank 
(http://pdb.ccdc.cam.ac.uk/pdb/) was searched using BLASTP2 (Altschul et al, 1990 
J. Mol. Biol. 215: 403-10; Huang et al, 1991 Adv. Appl. Math. 12: 337-67; and 
Peitsch, 1995 PDB Quart. Newsletter 72: 4) to find all similarities of the target 
25 sequence with sequences of known structure. This program selects all templates with 

sequence identities above 25% and projected model size greater than 20 residues. 
This step also detects domains that can be modelled based on unrelated templates. 
Use of this step resulted in the selection of Protein Data Bank (PDB) structure, 1IJQ 
("Crystal Structure Of The LDL Receptor YWTD-EGF Domain Pair") (Jeon et al, 
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2001 Nat Struct. Biol 8: 499). 

(B) Create ProModH (Altschul et aU 1990; Huang et al, 1991; and 
Peitsch, 1995) jobs and generate models with ProModll. 

(C) Superimpose related 3D structures. This method is based upon the 
diagonals of sequence similarity (the Dynamic sequence alignment algorithm 
(SIM)(Altschul et al 9 1990; Huang et al, 1991; and Peitsch, 1995). Specifically, the 
following steps are performed: 

(i) Primary match: Regions with sequence similarity are selected automatically 
(manual selection is also possible) and the corresponding residues matched in three- 
space, (ii) The primary match is further refined using expanding context spheres. 

(D) Generate a multiple alignment comprising the sequences to be modeled. 

(E) Generate a framework for the new sequence. 

(F) Based on the topological arrangement of corresponding atoms: (a) atoms 
which occupy a similar portion of space and are expected to have a structural 
counterpart in the new structure are used to compute the framework coordinates 
(averaged positions); (b) Side chains with fully incorrect geometries are removed. 

(G) Rebuild the lacking loops based on the geometry of the loop stems: (a) the 
stems of the loops to rebuild are used to scan a database of structural fragments 
derived from the Brookhaven Data Bank. Either the best fitting fragment or a 
framework derived from the five best fragments, is used as the new loop, or (b) the 
conformational space is searched using a CSP approach (seven allowed angle 
combinations, space allocation for the loop, space allocation for each a-carbon in the 
loop). Both methods will place only a-carbons when loops are in steric conflict with 
the surrounding context. 

(H) Rebuilt incomplete backbones based on the position of the a-carbons, the 
backbone was rebuilt using a set of seven allowed angle combinations and a 
database of backbone fragments (a sliding window of five residues was run through 
the protein sequence. The best matching backbone fragment for each overlapping 
pentapeptide was then stored. A framework for the main chain atoms is derived from 
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15 



these peptides, using only the coordinates of the three central residues of each 
pentapeptide. 



(i) Verification of structural quality is based on the method described by 
R. Luthy et ah, 1992 Nature 356: 83-85. This method analyzes the 3D context 
of each residue and allows the identification of mis-folded regions. 

(ii) Packing is checked based on a probe accessible surface (Connoly 
surface) computation and a cubic grid passed through the structure. Inside and 
outside surfaces are detected and the center and size of each cavity is 
computed. The algorithm then compares the size and distribution of these 
cavities between the model and the structure in order to detect possibly mis- 
folded regions. 

(J) Refine structure by energy minimization and molecular dynamics based on 
force field computations following Gromos96 {Methods for the evaluation of long- 
range electrostatic forces in computer simulations of molecular systems, IN 
Computer Simulation of Biomolecular Systems, Theoretical and 
Experimental Applications, Vol. 2, 182-212 (W.F. van Gunsteren et ah, eds., 
Escom Science Publishers, Leiden, The Netherlands, 1993); van Gunsteren et ah, 
1990 Angew. Chem. Int. Ed. Engl 29: 992-1023; van Gunsteren et ah, 1992 Eur. J. 
Biochem. 204: 947-961; Torda and W.F. van Gunsteren. Molecular Modeling Using 
Nuclear Magnetic Resonance Data, IN REVIEWS IN COMPUTATIONAL CHEMISTRY, 
Vol. m, 143-172 (K. B. Lipkowitz et al, eds., VCH Publishers, Inc. New York, 
1992); van Gunsteren, Molecular dynamics and stochastic dynamics simulation: A 
primer, in COMPUTER SIMULATION OF BIOMOLECULAR SYSTEMS, THEORETICAL AND 

Experimental Applications 3-36; van Gunsteren et ah, Computation of free energy 
in practice: choice of approximations and accuracy limiting factors. IN COMPUTER 
Simulation of Biomolecular Systems, Theoretical and Experimental 
Applications 3 15-348; Smith et ah, Methods for the evaluation of long-range 
electrostatic forces in computer simulations of molecular systems, IN COMPUTER 



00 



The model's structural quality is verified and packing is checked as follows: 
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Simulation of Biomolecular Systems, Theoretical and Experimental 
Applications 1 82-2 12; van Gunsteren et ah , Accounting for Molecular Mobility in 
Structure Determination Based on Nuclear Magnetic Resonance Spectroscopic and 
X-Ray Diffi-action Data, in Methods in Enzymology: Nuclear Magnetic 
5 Resonance, Vol. 239 619-654 (T.L. James et ah, eds., Academic Press, New York, 

1994); van Gunsteren et ah, 199 >4 Quart Rev. Biophysics 27: 435-481; van 
Gunsteren et ah, 1995 Biomolecular Modelling: Ovei*view of Types of Methods to 
Search and Sample Conformational Space, IN PROCEEDINGS OF THE 1ST EUROPEAN 
Conference on Computational Chemistry, American Institute of Physics 
10 CONF. Proc. 330: 253-268; and van Gunsteren et ah, 1995 Computer Phys. 

Communications 91: 305-319). 

(K)The structure is then manually refined based on sequence alignment and structural 
elements using Deep View, the SWISS-PDB Viewer (Guex et ah, 1997 
Electrophoresis 18: 2714-23). 

15 

Uses of structural model. Having a three-dimensional model of the protein domain 
allows one to appreciate the context of the HBM mutation, or other HBM-like mutations, in 
three dimensional space. This facilitates the prediction of possible mechanisms of action in a 
manner impossible in any other way because one cannot predict from primary sequence alone 

20 the proximity of one distant (in sequence space) amino acid to another. This also allows one 
to make functional predictions that can then be tested using molecular/cellular biology 
techniques to further refine the model and validate the target. For example, the structural 
model of the G171 V mutation predicted an alteration in side chain-side chain interactions, so 
we were able to use this information to successfully predict similar mutations in other amino 

25 acids that would have analogous structural effects. Overlaying other functional data on this 
model would also able highlight other potentially important regions of the protein domain. 
Additionally, such models show the accessible residues, which would be useful for 
developing small compounds, antibodies and the like which recognize and bind to this 
region. 
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Results. Based on the space filling hypothesis set forth in Section 3 above, two 
residues from blade 6 of propeller 1 were identified as being in structurally equivalent 
locations to residue 171 in blade 4. Accordingly, substitutions with valine were engineered: 
F241Vand A242V. 

5 Residues that were predicted to be accessible to the interior surface of the propeller 

were identified in both blades 3 and 5 of Zmaxl (LRP5). Original predictions from the 
Springer model identified G199 on blade 5 and E128 on blade 3 as the equivalent position to 
residue 171 of blade 4 (HBM). Using the more sophisticated model described above, G199 
and E128 are not predicted to be in the equivalent position to residue 171 of blade 4. Based 
10 on the new, more sophisticated model, the following residues were chosen for valine 
substitutions: 

Blade No. Mutations 

5 L200V, T201 V and I202V 

3 S127V 

15 

Another residue T125 was selected for a conservative substitution with T125S as well as a 
less conservative substitution of T125G. 

The role of the YWTD repetitive motifs in the (3-propeller was examined by carrying 
out an alanine replacement scan over this region in Blade 4. These repeats are predicted to 
20 create the p-sheet blades of the propeller. Based on this model data, the following additional 
mutations were prepared in propeller 1 : Y164A, M165A, Y166A, W167A, T168A, and 
D169A. These substitutions would be predicted to be detrimental to the P-propeller structure. 

The spacing filling or occupied space model was further examined by mutating a 
residue predicted by the above methods to reside on the exterior surface of propellar 1, blade 
25 4. This mutation, K215V (K-V at position 215), was selected because it was believed to not 
produce an HBM-like effect. 

Based on this information, the following model of the HBM G171V mutation was 
obtained. Specifically, substitutions of valine at specific locations within propeller 1 could 
result in the creation of hydrophobic patches on the surface of the propeller. For example, 
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the V at residue 171 of the HBM mutant comes in close proximity to LI 50 on the adjacent 
strand, potentially creating a hydrophobic patch that was not present with the wild type 
G171. An example of this prediction, and others are presented in a separate file (Fig. 52). 

Based on this model, a mutation of L150G which would remove the hydrophobic side 
5 chain at this site may not produce an HBM effect. In addition, a double mutant of G171 V 
(HBM) and L150G may not produce an HBM effect due to the elimination of the 
hydrophobic side chain interactions. The model can also be used to test this prediction on a 
different propeller 1 blade. The substitution A214V was shown to result in an HBM effect. 
1194 is the residue that is in the equivalent position to A214 on blade 5 as L150 is to G171 on 

10 blade 4. While the mutation of I194G alone may not be predicted to result an HBM effect, 
the double mutant of I194G and A214V may no longer permit a hydrophobic patch to form 
and as a result not generate the HBM effect observed with A214V. Additional analyses of 
the residues in proximity to A214 that may contribute to the formation of hydrophobic 
patches identified P240 and F241. Mutations of either P240G or F241G together with 

15 A214V may no longer permit the formation of hydrophobic patches and may eliminate the 
HBM effect observed with A214V alone. 

Lastly, the new model was used to investigate specific aspects of the other three 
propellers. The relevance of residues R494, R570 and V667 in the three other propellars was 
investigated. These three residues were chosen since they have been reported to be the site of 

20 osteoporosis pseudoglioma disease causing missense mutations: R494Q, R570W and V667M 
(Gong et al y 2001 Cell 107: 513-523). Based on the structural location of these residues on 
the outer surface of the propeller, the model would not predict that these substitutions are 
either gain- or loss-of-function substitutions (Fig. 53A-C). For example, the mutations 
related to osteoporosis-pseudoglioma syndrome (OPPG) result in patients having very low 

25 bone mass and who are prone to developing fractures and deformation. This genetic 

condition is known now to result from mutations in LRP5 (Gong et aL, 2001 Cell 107: 513- 
23). Based on this information, this mutation would be categorized as a loss in function 
mutation. 

As a result, we have engineered these receptor variant to functionally test. These new 
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receptor variants were generated exactly as described above with the following 
oligonucleotides: 

L150GF GAAGGTGCTCTTCTGGCAGGACGGTGACCAGCCGAGGGCC 
LI 50GR GGCCCTCGGCTGGTCACCGTCCTGCCAGAAGAGCACCTTC 

I194GF GATCATTGTGGACTCGGACGGTTACTGGCCCAATGGACTG 
I194GR CAGTCCATTGGGCCAGTAACCGTCCGAGTCCACAATGATC 



10 



L200VF CATTTACTGGCCCAATGGAGTGACCATCGACCTGGAGGAGCAGAAGC 
L200VR GCTTCTGCTCCTCCAGGTCGATGGTCACTCCATTGGGCCAGTAAATG 



15 



T201VF CATTTACTGGCCCAATGGACTGGTCATCGACCTGGAGGAGCAGAAGC 

T201VR GCTTCTGCTCCTCCAGGTCGATGACCAGTCCATTGGGCCAGTAAATG 

I202VF CATTTACTGGCCCAATGGACTGACCGTCGACCTGGAGGAGCAGAAGC 

I202VR GCITCTGCTCCTCCAGGTCGACGGTCAGTCCATTGGGCCAGTAAATG 



20 



T125SF GGCAAGAAGCTGTACTGGTCGGACTCAGAGACCAACCGCATC 

T125SR GATGCGGTTGGTCTCTGAGTCCGACCAGTACAGCTTCTTGCC 

T125GF GGCAAGAAGCTGTACTGGGGGGACTCAGAGACCAACCGCATC 

T125GR GATGCGGTTGGTCTCTGAGTCCCCCCAGTACAGCTTCTTGCC 



25 



S127VF GAAGCTGTACTGGACGGACGTAGAGACCAACCGCATCGAGGTG 
S127VR CACCTCGATGCGGTTGGTCTCTACGTCCGTCCAGTACAGCTTC 



30 



F24 1 VF CCTGACGCACCCCGTCGCCCTGACGCTCTCCGGGGACACTC 

F241VR GAGTGTCCCCGGAGAGCGTCAGGGCGACGGGGTGCGTCAGG 

A242VF CCTGACGCACCCCTTCGTCCTGACGCTCTCCGGGGACACTC 

A242VR GAGTGTCCCCGGAGAGCGTCAGGACGAAGGGGTGCGTCAGG 



35 



K2 1 5 VF CTACTGGGCTGACGCCGTGCrCAGCTTCATCCACCGTGCC 

K215VR GGCACGGTGGATGAAGCTGAGCACGGCGTCAGCCCAGTAG 

Yl 64AF CTTGGACCCCGCTCACGGGGCCATGTACTGGACAGACTGG 

Y164AR CCAGTCTGTCCAGTACATGGCCCCGTGAGCGGGGTCCAAG 



40 



Ml 65AF GGACCCCGCTCACGGGTACGCGTACTGGACAGACTGGGGTG 
Ml 65 AR CACCCCAGTCTGTCCAGTACGCGTACCCGTGAGCGGGGTCC 



45 



Yl 66AF CCCGCTCACGGGTACATGGCCTGGACAGACTGGGGTGAGAC 

Y166AR GTCTCACCCCAGTCTGTCCAGGCCATGTACCCGTGAGCGGG 

Wl 67AF CGCTCACGGGTACATGTACGCGACAGACTGGGGTGAGACGC 

W 1 67 AR GCGTCTCACCCCAGTCTGTCGCGTACATGTACCCGTGAGCG 
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T168AF 
T168AR 



GCTCACGGGTACATGTACTGGGCAGACTGGGGTGAGACGCC 
GGCGTCTCACCCCAGTCTGCCCAGTACATGTACCCGTGAGC 



D169AF 
D169AR 



CACGGGTACATGTACTGGACAGCCTGGGGTGAGACGCCCCG 
CGGGGCGTCTCACCCCAGGCTGTCCAGTACATGTACCCGTG 



R494QF 
R490QR 



CAACTTGGATGGGCAGGAGCAGCGTGTGCTGGTCAATGCCTC 
GAGGCATTGACCAGCACACGCTGCTCCTGCCCATCCAAGTTG 



R570WF 
R570WR 



GCAGCGCCGCAGCATCGAGTGGGTGCACAAGGTCAAGGCCAG 
CTGGCCTTGACCTTGTGCACCCACTCGATGCTGCGGCGCTGC 



V667MF 
V667MR 



CGAGACCAATAACAACGACATGGCCATCCCGCTCACGGGCG 
CGCCCGTGAGCGGGATGGCCATGTCGTTGTTATTGGTCTCG 



Point mutations in other molecules with beta-propellers have been described in the 
literature. For example, alpha4-integrin (Guerrero-Esteo et aL, 1998 FEBS Letter 429: 123- 
8) has had two mutations introduced, which resulted in altered protein function (i.e., G130R 
and G190S). Both of these glycines are in structurally equivalent locations to G171 of LRP5, 
just located on different blades of the beta-propeller at the upper surface. Using the model 
discussed above, we mapped these residues of the alpha4-integrin and confirmed their 
placement. The result of these substitutions in the alpha4 integrin was loss of ligand binding 
and reduced affinity to its heterodimer binding partner: integrin-betal, such that the alpha4- 
betal (a4pl) heterodimer cannot be formed. Examples of modulation of other residues in 
integrin-alpha4 with functional consequences are referenced within Guerrero-Esteo et ah 
(1998). Thus, these data support the notion that disruption of beta-propeller structure can 
result in significant and dramatic effects on receptor function. 

All references cited are herein incorporated by reference in their entirety for all 
purposes. The following applications are also incorporated by reference in their entirety 
herein: U.S. Application Nos. 09/543,771 and 09/544,398 filed on April 5, 2000, which are a 
continuation-in-part of Application No. 09/229,319, filed January 13, 1999, which claims 
benefit of U.S. Provisional Application No. 60/071,449, filed January 13, 1998, and U.S. 
Provisional Application No. 60/105,51 1, filed October 23, 1998. Additionally This 
application claims priority of Application Nos. 60/290,071 filed May 11, 2001; 60/291,311 
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filed May 17, 2001; 60/353,058 filed February 1, 2002, and 60/361,293 filed March 4, 2002 
the texts of which are herein incorporated by reference in their entirety for all purposes. 
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CLAT M S 

What is claimed is: 

1 . A nucleic acid comprising a mutation in LRP5 or LRP6 which results in a 
HBM-like phenotype when expressed in a cell, wherein said HBM-like phenotype results in 
hone mass modulation and/or lipid level modulation. 

2. A LRP5 nucleic acid comprising at least one mutation of Tables 2 or 3 or 
which results in a mutation of one of the following G171V, A214V, A65V, M282V, G171K, 
G171F, G171I, G171Q, L200V, T201V, I202V, or S127V when expressed in a cell, and 
wherein expression of the nucleic acid in a subject results in bone mass modulation and/or 
lipid level modulation. 

3 . A LRP6 nucleic acid comprising at least one of Tables 2 or 3 or which results 
in amutation of one of the following G171V, A214V, A65V, M282V, G171K, G171F, 
G171I, G171Q, L200V, T201V, I202V, or S127V when expressed in a cell, and wherein said 
mutation occurs in LRP6 in a position equivalent to LRP5 such that expression of the nucleic 
acid in a subject results in bone mass modulation and/or lipid level modulation. 

4. A LRP5 nucleic acid encoding at least one amino acid mutation in propeller 1 . 

^ 

5. The LRP5 nucleic acid of claim 4, wherein said nucleic acid encodes for at 
least one mutation selected from the group consisting of: G171V, A214V, A65V, M282V, 
G171K, G171F, G171I, G171Q, L200V, T201V, I202V, and S127V. 

6. A LRP6 nucleic acid encoding at least one amino acid mutation in propeller 1. 

7. The LRP6 nucleic acid of claim 6, wherein said nucleic acid encodes for at 
least one mutation in an equivalent position of LRP6 selected from the group of LRP5 
mutations consisting of: G171V, A214V, A65V, M282V, G171K, G171F, G171I, G171Q, 
L200V, T201V, I202V, and S127V. 

8. The nucleic acid of claims 1-7, wherein the mutation is G171 V, A214V, 
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A65V, M282V, G171K, G171F, G171I or G171Q in LRP5 or in an equivalent domain in 
LRP6. 

9. A polypeptide encoded by a nucleic acid of claims 1 to 8, wherein said 
polypeptide when expressed in a cell modulates Wnt signaling, Dkk activity, LRP5 activity 
and/or LRP6 activity. 

10. The polypeptide of claim 9, wherein said polypeptide modulates bone mass 
and/or lipid levels when expressed in a subject. 

11. A vector comprising a nucleic acid of claims 1 to 8. 

12. A cell comprising the vector of claim 1 1 . 

13. The cell of claim 12, wherein said cell is a cancer cell, a liver cell or a bone 

cell. 

14. A polypeptide or biologically active fragment thereof derived from LRP5 
comprising at least one amino acid change of Table 2, G171V, A214V, A65V, M282V, 
G171K, G171F, G171I, G171Q, L200V, T201V, I202V, or S127V. 

15. The polypeptide or biologically active fragment of claim 14, wherein said 
protein or biologically active fragment comprises at least one amino acid change of G171 V, 
A214V, A65V, M282V, G171K, G171F, G171I, G171Q, L200V, T201V, I202V, or S127V. 

16. A polypeptide or biologically active fragment thereof derived from LRP6 
comprising at least one amino acid change of Table 2, G171V, A214V, A65V, M282V, 
G171K, G171F, G171I, G171Q, L200V, T201V, I202V, or S127V when the amino acid 
change is expressed in an equivalent position in LRP6. 

17. The polypeptide or biologically active fragment of claim 16, wherein said 
protein or biologically active fragment comprises at least one amino acid change of G171 V, 
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A214V, A65V, M282V, G171K, G171F, G171I, G171Q, L200V, T201V, I202V, or S127V. 

18. A polypeptide or biologically active fragment which when expressed in a 
subject has a HBM-like phenotype comprising at least one amino acid mutation in propeller 1 
with the proviso that if the mutation is GV171 that a second mutation which maintains the 
HBM-like phenotype is also present. 

1 9. The polypeptide or biologically active fragment of claim 1 8, wherein said 
polypeptide is derived from LRP5, LRP6 or HBM. 

20. The polypeptide or biologically active fragment of claims 14-19, wherein the 
amino acid change is G171V, A214V, A65V, M282V, G171K, G171F, G171I or G171Q of 
LRP5 or an equivalent position in LRP6. 

21. An antibody or immunogenic fragment thereof which binds to a polypeptide 
or antigenic fragment thereof of claims 14 to 20. 

22. The antibody or immunogenic fragment thereof of claim 21, wherein the 
antibody is a monoclonal antibody, a chimeric antibody, a bispecific antibody, a humanized 
antibody, a primatized® antibody, a human antibody, or a labeled antibody. 

23 . An antibody which binds to a polypeptide comprising 
208 KLYWADAKLSFIHRAN 223 , 277 ALYSPMDIQVLSQER 291 , 61 GLEDAAAVDFQFSKGA 73 , 
^EGSLTHPFALTLSG 247 , 249 TLYWTDWQTRSIHACN 264 , 144 VLFWQDLDQPRAI 156 , 
194 IYWPNGLTIDLEEQKLY 210 , 34 LLLFANKRDVRLVD 47 , 

75 GAVYWTDVSEEAIKQ 89 , 121 KLYWTDSETNRIEVA 135 of LRP5 or an equivalent domain 
on LRP6 or variants thereof. 

24. An antibody which binds to a polypeptide comprising 
969 LILPLHGLRNVKAIDYDPLDKFIYW 993 , 989 KFIYWVDGRQNIKRAKD^ 
1009 QPFVLTSLSQGQNPDRQPHDLSIDI 1033 , 1029 LSroiYSRTLFWTCEATOT , lNVHRL 1053 , 
1049 NVHRLSGEAMGWLRGDRDKPRAIV 1073 , 1253 CGEPPTCSPDQFAC 1266 , 
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1278 WRCDGFPECDDQSDEEGC 1295 , 13l6 RCDGEADCQDRSDEADC 1332 , 
l370 CEITKPPSDDSPAH 1383 of LRP5 or an equivalent domain on LRP6 or variants thereof, 
wherein said antibody modulates binding of LRP5, HBM, LRP6 or variants thereof to Dkk. 

25. A method of diagnosing a HBM-like phenotype in a subject comprising 

(A) obtaining a biological sample from the subject; 

(B) exposing the sample to an antibody or an immunogenic fragment of claims 
21-24; and 

(C) detecting whether the antibody bound a protein from the biological sample 
from the subject to determine whether the subject has a HBM-like phenotype. 

26. A composition for modulating bone mass and/or lipid levels in a subject 
comprising a therapeutically effective amount of an antibody of claims 21-24 and a 
pharmaceutically acceptable carrier. 

27. An antibody or an immunogenic fragment thereof of claims 21-24, wherein 
the antibody or immunogenic fragment thereof can (1) discriminate between LRP5 and an 
HBM-like protein, (2) discriminate between LRP6 and an HBM-like protein, or (3) 
discriminate between HBM and an HBM-like protein. 

28. A transgenic animal having somatic and/or germ cells comprising a nucleic 
acid which comprises a promoter region that directs protein expression in animal and/or 
human cells operably linked to a nucleic acid of claims 1 to 8, and wherein said transgenic 
animal has at least three bone parameters modulated by the expression of said nucleic acid. 

29. The transgenic animal of claim 28, wherein the promoter region is selected 
from the group consisting of CMV, RSV, SV40, and EF-la, CMVpbActin, histone, type I 
collagen, TGFpl, SX2, cfos/cjun, Cbfal, Fra/Jun, Dlx5, osteocalcin, osteopontin, bone 
sialoprotein, and collagenase promoter regions. 

30. A transgenic animal having somatic and/or germ cells comprising a nucleic 
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acid which comprises a sequence which encodes a polypeptide of claims 9 or 10, and 
wherein the nucleic acid further comprises an operably linked promoter region that directs 
protein expression in animal and/or human cells, and wherein the transgenic animal has at 
least three bone parameters modulated by the expression of said nucleic acid. 

31. An animal embryo comprising a nucleic acid comprising a promoter region 
that directs protein expression in animal and/or human cells operably linked to a nucleic 
acid of claims 1 to 8. 

32. An animal or human cell transfected with a nucleic acid which comprises a 
promoter region that directs protein expression in animal and/or human cells operably 
linked to a nucleic acid of claims 1 to 8. 

33 . The transgenic animal of claim 28, wherein a human HBM-like protein is 
expressed and passes the human HBM-like gene to its offspring. 

34. A transgenic animal produced from the transgenic animal of claim 33 or its 
offspring. 

35. An animal model for the study of bone density modulation and/or lipid level 
modulation comprising a first group of animals composed of the transgenic animal of claim 
28 and a second group of control animals. 

36. The animal model of claim 35, wherein the group of transgenic animals have 
cells which comprise a nucleic acid encoding human an HBM-like protein or biologically 
active polypeptide fragment thereof. 

37. An animal model for the study of bone density modulation comprising a first 
group of animals composed of the transgenic animals of claim 30 and a second group of 
control animals. 

38. A method of identifying agents which modulate the activity of an HBM-like 
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nucleic acid comprising: 

(a) transfecting a cell with a vector of claim 1 1 ; 

(b) exposing the transfected cell of step (a) to a compound; and 

(c) determining whether the compound modulates the activity of the HBM-like 
nucleic acid. 

39. A method of identifying agents which modulate the activity of an HBM-like 
protein comprising: 

(a) transfecting a cell with a vector of claim 1 1 ; 

(b) exposing the transfected cell of step (a) to a compound; and 

(c) determining whether the compound modulates the activity of the HBM-like 



40. The method of claim 39, wherein the compound is a hormone, a growth factor, 
a peptide, RNA, shRNA, siRNA, DNA, a mineral, a vitamin, a natural product, or a synthetic 
organic compound. 

41. A method for identifying compounds which modulate the interaction of Dkk 
with the Wnt signaling pathway comprising: 

(a) transfecting cells with constructs containing a nucleic acid of claims 1 to 
claim 8; 

(b) assessing changes in expression of a reporter element linked to a Wnt- 
responsive promoter; and 

(c) identifying as a Dkk/Wnt interaction modulating compound any compound 
which alters reporter gene expression compared with cells transfected with a Dkk construct 
alone. 

42. The method according to Claim 41, wherein the cells are cancer cells, liver 
cells or bone cells. 

43. The method according to Claim 42, wherein the cells are U2-OS, HOB-03- 
CE6,orHEK293 cells. 



protein. 
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44. The method according to Claim 41, wherein the reporter element used is TCF- 
luciferase, tk-Renilla, or a combination thereof. 

45. A method of diagnosing a subject as expressing a nucleic acid comprising a 
nucleotide change of Tables 2 or 3, the method comprising the steps of: 

(A) obtaining a biological sample from the subject; and 

(B) assaying for the presence of the nucleotide change which results in HBM 



46. A method for identifying agents which modulate LRP5, LRP6, or HBM 
comprising the steps of: 

(A) providing cells according to claim 32; 

(B) exposing the cells to a test compound; and 

(C) measuring the expression of LRP5, LRP6 or HBM respectively. 

47. The method for identifying agents of claim 42, further comprising a step of 
determining whether the agent further modulates bone mass and/or lipid levels. 

48. An agent identified by the method of claims 46 or 47. 



phenotype. 
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Exon 1 

ACTAAAGCGCCGCCGCCGCGCCATGGAGCCCGAGTGAGCGCGGCGCG 
GGCCCGTCCGGCCGCCGGACAACATfiGAGGCAGCGCCGCCCGGGCCG 
CCGTGGCCGCTGCTGCTGCTGCTGCTGCTGCTGCTGGCGCTGTGCGGC 
TGCCCGGCCCCCGCCGCGGCC 

Exon 2 Coordinates: 527dl2_Contig308G 30944-30549 

gccccacagCCTCGCCGCTCCTGCTATTTGCCAACCGCCGGGACGTACGGC 

TGGTGGACGCCGGCGGAGTCAAGCTGGAGTCCACCATCGTGGTCAGC 

GGCCTGGAGGATGCGGCCGCAGTGGACTTCCAGTTTTCCAAGGGAGC 

CGTGTACTGGACAGACGTGAGCGAGGAGGCCATCAAGCAGACCTACCT 

GAACCAGACGGGGGCCGCCGTGCAGAACGTGGTCATCTCCGGCCTGG 

TCTCTCCCGACGGCCTCGCCTGCGACTGGGTGGGCAAGAAGCTGTACT 

GGACGGACTCAGAGACCAACCGCATCGAGGTGGCCAACCTCAATGGC 

ACATCCCGGAAGGTGCTCTTCTGGCAGGACCTTGACCAGCCGAGGGCC 

ATCGCCTTGGACCCCGCTCACGGgtaaaccctgctg 

... 9408 nt ... 

Exon 3 Coordinates: 527dl2_Contig308G 21141-20945 

ccccgtcacagGTACATGTACTGGACAGACTGGGGTGAGACGCCCCGGATTG 

AGCGGGCAGGGATGGATGGCAGCACCCGGAAGATCATTGTGGACTCG 

GACATTTACTGGCCCAATGGACTGACCATCGACCTGGAGGAGCAGAAG 

CTCTACTGGGCTGACGCCAAGCTCAGCTTCATCCACCGTGCCAACCTG 

GACGGCTCGTTCCGgtaggtacccac 

... 6094 nt ... 

Exon 4 Coordinates: 527dl2_Contig308G 15047-14850 

tccctgactgcagGCAGAAGGTGGTGGAGGGCAGCCTGACGCACCCCTTCGCC 

CTGACGCTCTCCGGGGACACTCTGTACTGGACAGACTGGCAGACCCGC 

TCCATCCATGCCTGCAACAAGCGCACTGGGGGGAAGAGGAAGGAGAT 

CCTGAGTGCCCTATACTCACCCATGGACATCCAGGTGCTGAGCCAGGA 

GCGGCAGCCTTTCTgtgagtgccgg 

... 1827 nt... 

Exon 5 Coordinates: 527dl2_Contig308G 13220-13088 

tttctcagTCCACACTCGCTGTGAGGAGGACAATGGCGGCTGCTCCCACCTG 

TGCCTGCTGTCCCCAAGCGAGCCTTTCTACACATGCGCCTGCCCCACG 

GGTGTGCAGCTGCAGGACAACGGCAGGACGTGTAAGGCAGgtgaggcggtgg 

gacg 



FIG. 3A 
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... 20923 nt ... 

Exon 6 Coordinates: 527dl2_Contig309G 7705-8100 

ctccacagGAGCCGAGGAGGTGCTGCTGCTGGCCCGGCGGACGGACCTAC 

GGAGGATCTCGCTGGACACGCCGGACTTCACCGACATCGTGCTGCAGG 

TGGACGACATCCGGCACGCCATTGCCATCGACTACGACCCGCTAGAGG 

GCTATGTCTACTGGACAGATGACGAGGTGCGGGCCATCCGCAGGGCG 

TACCTGGACGGGTCTGGGGCGCAGACGCTGGTCAACACCGAGATCAA 

CGACCCCGATGGCATCGCGGTCGACTGGGTGGCCCGAAACCTCTACTG 

GACCGACACGGGCACGGACCGCATCGAGGTGACGCGCCTCAACGGCA 

CCTCCCGCAAGATCCTGGTGTCGGAGGACCTGGACGAGCCCCGAGCC 

ATCGCACTGCACCCCGTGATGGGgtaagacgggc 

3211 nt 

Exon 7 Coordinates: 527dl2_Contig309G 11311-11482 

ttcttctccagCCTCATGTACTGGACAGACTGGGGAGAGAACCCTAAAATCGA 
GTGTGCCAACTTGGATGGGCAGGAGCGGCGTGTGCTGGTCAATGCCTC 
CCTCGGGTGGCCCAACGGCCTGGCCCTGGACCTGCAGGAGGGGAAGC 
TCTACTGGGGAGACGCCAAGACAGACAAGATCGAGgtgaggctcctgtgg 

13445 nt 

Exon 8 Coordinates: 527dl2_Contig309G 24927-25143 

ccgtcctgcagGTGATCAATGTTGATGGGACGAAGAGGCGGACCCTCCTGGA 

GGACAAGCTCCCGCACATTTTCGGGTTCACGCTGCTGGGGGACTTCAT 

CTACTGGACTGACTGGCAGCGCCGCAGCATCGAGCGGGTGCACAAGG 

TCAAGGCCAGCCGGGACGTCATCATTGACCAGCTGCCCGACCTGATGG 

GGCTCAAAGCTGTGAATGTGGCCAAGGTCGTCGgtgagtccggggggtc 

....2826 nt 

Exon 9 Coordinates: 527dl2_Contig309G 27969-28256 

gttcgcttccagGAACCAACCCGTGTGCGGACAGGAACGGGGGGTGCAGCCA 

CCTGTGCTTCTTCACACCCCACGCAACCCGGTGTGGCTGCCCCATCGG 

CCTGGAGCTGCTGAGTGACATGAAGACCTGCATCGTGCCTGAGGCCTT 

CTTGGTCTTCACCAGCAGAGCCGCCATCCACAGGATCTCCCTCGAGAC 

CAATAACAACGACGTGGCCATCCCGCTCACGGGCGTCAAGGAGGCCTC 

AGCCCTGGACTTTGATGTGTCCAACAACCACATCTACTGGACAGACGT 

CAGCCTGAAGgtagcgtgggc 

3102 

FIG. 3B 
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Exon 10 Coordinates: 527dl2_Contig309G 31358-31582 

cctgctgccagACCATCAGCCGCGCCTTCATGAACGGGAGCTCGGTGGAGCA 

CGTGGTGGAGTTTGGCCTTGACTACCCCGAGGGCATGGCCGTTGACTG 

GATGGGCAAGAACCTCTACTGGGCCGACACTGGGACCAACAGAATCGA 

AGTGGCGCGGCTGGACGGGCAGTTCCGGCAAGTCCTCGTGTGGAGGG 

ACTTGGACAACCCGAGGTCGCTGGCCCTGGATCCCACCAAGGGgtaagtgtt 

tgcctgtc 



Exon 11 Coordinates: 527dl2_Contig309G 32879-33064 

gtgccttccagCTACATCTACTGGACCGAGTGGGGCGGCAAGCCGAGGATCG 
TGCGGGCCTTCATGGACGGGACCAACTGCATGACGCTGGTGGACAAG 
GTGGGCCGGGCCAACGACCTCACCATTGACTACGCTGACCAGCGCCTC 
TACTGGACCGACCTGGACACCAACATGATCGAGTCGTCCAACATGCTG 

Ggtgagggccgggct 

2069 nt 

Exon 12 Coordinates: 527dl2_Contig309G 35133-35454 

gtgttcatgcagGTCAGGAGCGGGTCGTGATTGCCGACGATCTCCCGCACCCG 

TTCGGTCTGACGCAGTACAGCGATTATATCTACTGGACAGACTGGAAT 

CTGCACAGCATTGAGCGGGCCGACAAGACTAGCGGCCGGAACCGCAC 

CCTCATCCAGGGCCACCTGGACTTCGTGATGGACATCCTGGTGTTCCA 

CTCCTCCCGCCAGGATGGCCTCAATGACTGTATGCACAACAACGGGCA 

GTGTGGGCAGCTGTGCCTTGCCATCCCCGGCGGCCACCGCTGCGGCT 

GCGCCTCACACTACACCCTGGACCCCAGCAGCCGCAACTGCAGCCgtaag 

tgcctcatggt 

2006 nt 

Exon 13 Coordinates: 527dl2_Contig309G 37460-37659 

gcctcctctaCGCCCACCACCTTCTTGCTGTTCAGCCAGAAATCTGCCATCAG 

TCGGATGATCCCGGACGACCAGCACAGCCCGGATCTCATCCTGCCCCT 

GCATGGACTGAGGAACGTCAAAGCCATCGACTATGACCCACTGGACAA 

GTTCATCTACTGGGTGGATGGGCGCCAGAACATCAAGCGAGCCAAGGA 

CGACGGGACCCAGgcaggtgccctgtgg 



1297 nt 



6965 nt 



FIG. 3C 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO. 



02092000 A2_l_> 



WO 02/092000 



PCT/US02/14877 



8/89 



Exon 14 Coordinates: 527dl2_Contig309G 44624-44832 

ctttgtcttacagCCCTTTGTTTTGACCTCTCTGAGCCAAGGCCAAAACCCAGAC 

AGGCAGCCCCACGACCTCAGCATCGACATCTACAGCCGGACACTGTTC 

TGGACGTGCGAGGCCACCAATACCATCAACGTCCACAGGCTGAGCGG 

GGAAGCCATGGGGGTGGTGCTGCGTGGGGACCGCGACAAGCCCAGGG 

CCATCGTCGTCAACGCGGAGCGAGGgtaggaggccaac 

1404 nt 

Exon 15 Coordinates: 527dl2_Contig309G 46236-46427 

ccaccctcccgcagGTACCTGTACTTCACCAACATGCAGGACCGGGCAGCCAA 

GATCGAACGCGCAGCCCTGGACGGCACCGAGCGCGAGGTCCTCTTCA 

CCACCGGCCTCATCCGCCCTGTGGCCCTGGTGGTGGACAACACACTGG 

GCAAGCTGTTCTGGGTGGACGCGGACCTGAAGCGCATTGAGAGCTGT 

GACCTGTCAGgtacgcgccccgg 

686 nt 

Exon 16 Coordinates: 527dl2_Contig309G 47113-47322 

ggctgcttgcagGGGCCAACCGCCTGACCCTGGAGGACGCCAACATCGTGCA 

GCCTCTGGGCCTGACCATCCTTGGCAAGCATCTCTACTGGATCGACCG 

CCAGCAGCAGATGATCGAGCGTGTGGAGAAGACCACCGGGGACAAGC 

GGACTCGCATCCAGGGCCGTGTCGCCCACCTCACTGGCATCCATGCAG 

TGGAGGAAGTCAGCCTGGAGGAGTTCTgtacgtgggggc 

3884 nt 

Exon 17 Coordinates: 527dl2_Contig309G 51206-51331 

ttgtctttgcagCAGCCCACCCATGTGCCCGTGACAATGGTGGCTGCTCCCACA 
TCTGTATTGCCAAGGGTGATGGGACACCACGGTGCTCATGCCCAGTCC 
ACCTCGTGCTCCTGCAGAACCTGCTGACCTGTGGAGgtaggtgtgacctaggtgc 

....3905 nt 

Exon 18 Coordinates: 527dl2_Contig309G 55236-55472 

gttctcctctgtccctcccccagAGCCGCCCACCTGCTCCCCGGACCAGTTTGCATGT 

GCCACAGGGGAGATCGACTGTATCCCCGGGGCCTGGCGCTGTGACGG 

CTTTCCCGAGTGCGATGACCAGAGCGACGAGGAGGGCTGCCCCGTGT 

GCTCCGCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCTGC 

GCCTGCGCTGCGACGGCGAGGCAGACTGTCAGGACCGCTCAGACGAG 

GTGGACTGTGACGgtgaggccctcc 

3052 nt 

FIG. 3D 
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Exon 19 Coordinates: 527dl2_Contig309G 58524-58634 

tctccttgcagCCATCTGCCTGCCCAACCAGTTCCGGTGTGCGAGCGGCCAGT 

GTGTCCTCATCAAACAGCAGTGCGACTCCTTCCCCGACTGTATCGACG 

GCTCCGACGAGCTCATGTGTGgtgagccagctt 



Exon 20 Coordinates: 527dl2_Contig309G 60082-60319 

gtttgtctctggcagAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACA 

GCAGTGCCATCGGGCCCGTCATTGGCATCATCCTCTCTCTCTTCGTCAT 

GGGTGGTGTCTATTTTGTGTGCCAGCGCGTGGTGTGCCAGCGCTATGC 

GGGGGCCAACGGGCCCTTCCCGCACGAGTATGTCAGCGGGACCCCGC 

ACGTGCCCCTCAATTTCATAGCCCCGGGCGGTTCCCAGCATGGCCCCT 

TCACAGgtaaggagcctgagatatggaa 



....1095 nt 

Exon 21 Coordinates: 527dl2_Contig309G 61414-61552 

cttccctgccagGCATCGCATGCGGAAAGTCCATGATGAGCTCCGTGAGCCTG 
ATGGGGGGCCGGGGCGGGGTGCCCCTCTACGACCGGAACCACGTCAC 
AGGGGCCTCGTCCAGCAGCTCGTCCAGCACGAAGGCCACGCTGTACCC 

GCCGgtgaggggcggg 



Exon 22 Coordinates: 527dl2_Contig309G 68065-68162 

ttggctctcctcagATCCTGAACCCGCCGCCCTCCCCGGCCACGGACCCCTCCC 
TGTACAACATGGACATGTTCTACTCTTCAAACATTCCGGCCACTGCGA 
GACCGTACAGgtaggacatcccctgcag 



1448 nt 



6513 nt 



.2273 nt. 



FIG. 3E 
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Exon 23 Coordinates: 527dl2_Contig309G 70435-70901 

tcaaacattccggccactgcgagaccgtacagGCCCTACATCATTCGAGGAATGGCGCCCC 

CGACGACGCCCTGCAGCACCGACGTGTGTGACAGCGACTACAGCGCC 

AGCCGCTGGAAGGCCAGCAAGTACTACCTGGATTTGAACTCGGACTCA 

GACCCCTATCCACCCCCACCCACGCCCCACAGCCAGTACCTGTCGGCG 

GAGGACAGCTGCCCGCCCTCGCCCGCCACCGAGAGGAGCTACTTCCAT 

CTCTTCCCGCCCCCTCCGTCCCCCTGCACGGACTCATC CTGACCTCGGC 

CGGGCC ACTCTGGCTTCTCTGTGCCCCTGT A A ATAGTTTTA A ATATGA AC A A 

AGAAAAAAATATATTTTATGATTTAA A A A AT A A ATATAATTGGGATTTTAA 

AAACATGAGAAATGTGAACTGTGATGGGGTGGGCAGGGCTGGGAGAACTT 

TGT AC AGTGG AGA A A T A TTT ATA A ACTTAATTTTGT AAAAC A 
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Mouse Zmaxl In situ hybridization 
1 00X Magnification 
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FIG. 10A 



Mouse Zmaxl In situ hybridization 
100X Magnification 
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Mouse Zmaxl In situ hybridization 
400X Magnification 
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FIG. 11A 



Mouse Zmaxl In situ hybridization 
400X Magnification 
Sense probe 
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Mouse Zmaxl In situ hybridization 
400X Magnification 
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FIG. 12A 



Mouse Zmaxl In situ hybridization 
400X Magnification 
Sense probe 
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FIG. 12B 
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Antisense Inhibition of Zmaxl Expression 
Antisense Oligos Controls Marker 
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FIG. 13 
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HBM: Constructs for transgenic mice 

Confirmation of expression 
Transient transfection into HOB-02-02 cells* 

CMV PActin ; , Type I collagen 

HBM Zmaxl HBM Zmaxl 

X 1,000 X 1,000 X 10 X10 

*Fold increase compared to Zmaxl 
in cells transfected with empty vector 

FIG. 17 



BNSDOCID: <WO 02092000A2_I_> 



SUBSTITUTE SHEET (RULE 26) 



WO 02/092000 



PCT/US02/14877 



o 1.5 

CD 

< 1.2 

8 0.9 



0T 

E 



x 

CO 



N 



0.6 
0.3 
0.0 



34/89 

Comparison of Human & Mouse TaqMan 
Primer/Probe Sets for Zmax1/HBM 



Human TaqMan Set 
Esa Mouse TaqMan Set - 
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Not 
Detected 
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Quantification of Human Zmaxl mRNA in a 




Human Zmaxl mRNA Measured (ng) 
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HBM: Transgenic mice 
mRNA expression by Taqman analysis* 

Tissue 

Line Tibia Femur Heart Gonad Brain Kidney Liver 

HBMMCBA 

2 7-10 20-90 2-30 6-11 5-9 <1 

13 1-2 6-7 3-4 5-6 <1 <1 

18 10-11 
HBMMTIC 

19 7-8 19-20 1 1 
35 1 1 0 0 

* relative to 2max1 in HOB-03-C5 

FIG. 20 
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HBM: In vivo pDXA - transgenic mice 
HBMMCBA construct 
SPINE 
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HBM: In vivo pDXA - transgenic mice 

HBMMCBA construct 
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FIG. 21 B 
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HBM: In vivo pDXA - transgenic mice 
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HBM: In vivo pDXA - transgenic mice 

HBMMTIC construct 
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HBM: In vivo pDXA - transgenic mice 

HBMMTIC construct 
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FIG. 21 F 
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HBM transgenic mice: in vivo pDXA* 
BMD % changes vs WT in 5 week old animals 



Line n Femur Spine Total Tibia mRNA 
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HBM transgenic mice: in vivo pDXA* 



BMD % changes vs WT in 9 week old animals 
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FIG. 24A 

HBMGI_2AS 

TCTAGACTCG AGGCGGCCGC CCATTGTGCA CTAAAGCGCC GCCGCCGCGC CATGGAGCCC 
GAGTGAGCGC GGCGCGGGCC CGTCCGGCCG CCGGACAACA TGGAGGCAGC GCCGCCCGGG 
CCGCCGTGGC CGCTGCTGCT GCTGCTGCTG CTGCTGCTGG CGCTGTGCGG CTGCCCGGCC 
CCCGCCGCGG CCTCGCCGCT CCTGCTATTT GCCAACCGCC GGGACGTACG GCTGGTGGAC 
GCCGGCGGAG TCAAGCTGGA GTCCACCATC GTGGTCAGCG GCCTGGAGGA TGCGGCCGCA 
GTGGACTTCC AGTTTTCCAA GGGAGCCGTG TACTGGACAG ACGTGAGCGA GGAGGCCATC 
AAGCAGACCT ACCTGAACCA GACGGGGGCC GCCGTGCAGA ACGTGGTCAT CTCCGGCCTG 
GTCTCTCCCG ACGGCCTCGC CTGCGACTGG GTGGGCAAGA AGCTGTACTG GACGGACTCA 
GAGACCAACC GCATCGAGGT GGCCAACCTC AATGGCACAT CCCGGAAGGT GCTCTTCTGG 
CAGGACCTTG ACCAGCCGAG GGCCATCGCC TTGGACCCCG CTCACGGGTA CATGTACTGG 
ACAGACTGGG TTGAGACGCC CCGGATTGAG CGGGCAGGGA TGGATGGCAG CACCCGGAAG 
ATCATTGTGG ACTCGGACAT TTACTGGCCC AATGGACTGA CCATCGACCT GGAGGAGCAG 
AAGCTCTACT GGGCTGACGC CAAGCTCAGC TTCATCCACC GTGCCAACCT GGACGGCTCG 
TTCCGGCAGA AGGTGGTGGA GGGCAGCCTG ACGCACCCCT TCGCCCTGAC GCTCTCCGGG 
GACACTCTGT ACTGGACAGA CTGGCAGACC CGCTCCATCC ATGCCTGCAA CAAGCGCACT 
GGGGGGAAGA GGAAGGAGAT CCTGAGTGCC CTCTACTCAC CCATGGACAT CCAGGTGCTG 
AGCCAGGAGC GGCAGCCTTT CTTCCACACT CGCTGTGAGG AGGACAATGG CGGCTGCTCC 
CACCTGTGCC TGCTGTCCCC AAGCGAGCCT TTCTACACAT GCGCCTGCCC CACGGGTGTG 
CAGCTGCAGG ACAACGGCAG GACGTGTAAG GCAGGAGCCG AGGAGGTGCT GCTGCTGGCC 
CGGCGGACGG ACCTACGGAG GATCTCGCTG GACACGCCGG ACTTCACCGA CATCGTGCTG 
CAGGTGGACG ACATCCGGCA CGCCATTGCC ATCGACTACG ACCCGCTAGA GGGCTATGTC 
TACTGGACAG ATGACGAGGT GCGGGCCATC CGCAGGGCGT ACCTGGACGG GTCTGGGGCG 
CAGACGCTGG TCAACACCGA GATCAACGAC CCCGATGGCA TCGCGGTCGA CTGGGTGGCC 
CGAAACCTCT ACTGGACCGA CACGGGCACG GACCGCATCG AGGTGACGCG CCTCAACGGC 
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FIG. 24B 

HBMGI_2AS 

ACCTCCCGCA AGATCCTGGT GTCGGAGGAC CTGGACGAGC CCCGAGCCAT CGCACTGCAC 
CCCGTGATGG GCCTCATGTA CTGGACAGAC TGGGGAGAGA ACCCTAAAAT CGAGTGTGCC 
AACTTGGATG GGCAGGAGCG GCGTGTGCTG GTCAATGCCT CCCTCGGGTG GCCCAACGGC 
CTGGCCCTGG ACCTGCAGGA GGGGAAGCTC TACTGGGGAG ACGCCAAGAC AGACAAGATC 
GAGGTGATCA ATGTTGATGG GACGAAGAGG CGGACCCTCC TGGAGGACAA GCTCCCGCAC 
ATTTTCGGGT TCACGCTGCT GGGGGACTTC ATCTACTGGA CTGACTGGCA GCGCCGCAGC 
ATCGAGCGGG TGCACAAGGT CAAGGCCAGC CGGGACGTCA TCATTGACCA GCTGCCCGAC 
CTGATGGGGC TCAAAGCTGT GAATGTGGCC AAGGTCGTCG GAACCAACCC GTGTGCGGAC 
AGGAACGGGG GGTGCAGCCA CCTGTGCTTC TTCACACCCC ACGCAACCCG GTGTGGCTGC 
CCCATCGGCC TGGAGCTGCT GAGTGACATG AAGACCTGCA TCGTGCCTGA GGCCTTCTTG 
GTCTTCACCA GCAGAGCCGC CATCCACAGG ATCTCCCTCG AGACCAATAA CAACGACGTG 
GCCATCCCGC TCACGGGCGT CAAGGAGGCC TCAGCCCTGG ACTTTGATGT GTCCAACAAC 
CACATCTACT GGACAGACGT CAGCCTGAAG ACCATCAGCC GCGCCTTCAT GAACGGGAGC 
TCGGTGGAGC ACGTGGTGGA GTTTGGCCTT GACTACCCCG AGGGCATGGC CGTTGACTGG 
ATGGGCAAGA ACCTCTACTG GGCCGACACT GGGACCAACA GAATCGAAGT GGCGCGGCTG 
GACGGGCAGT TCCGGCAAGT CCTCGTGTGG AGGGACTTGG ACAACCCGAG GTCGCTGGCC 
CTGGATCCCA CCAAGGGCTA CATCTACTGG ACCGAGTGGG GCGGCAAGCC GAGGATCGTG 
CGGGCCTTCA TGGACGGGAC CAACTGCATG ACGCTGGTGG ACAAGGTGGG CCGGGCCAAC 
GACCTCACCA TTGACTACGC TGACCAGCGC CTCTACTGGA CCGACCTGGA CACCAACATG 
ATCGAGTCGT CCAACATGCT GGGTCAGGAG CGGGTCGTGA TTGCCGACGA TCTCCCGCAC 
CCGTTCGGTC TGACGCAGTA CAGCGATTAT ATCTACTGGA CAGACTGGAA TCTGCACAGC 
ATTGAGCGGG CCGACAAGAC TAGCGGCCGG AACCGCACCC TCATCCAGGG CCACCTGGAC 
TTCGTGATGG ACATCCTGGT GTTCCACTCC TCCCGCCAGG ATGGCCTCAA TGACTGTATG 
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FIG. 24C 



HBMGI 2AS 



CACAACAACG GGCAGTGTGG GCAGCTGTGC CTTGCCATCC CCGGCGGCCA CCGCTGCGGC 
TGCGCCTCAC ACTACACCCT GGACCCCAGC AGCCGCAACT GCAGCCCGCC CACCACCTTC 
TTGCTGTTCA GCCAGAAATC TGCCATCAGT CGGATGATCC CGGACGACCA GCACAGCCCG 
GATCTCATCC TGCCCCTGCA TGGACTGAGG AACGTCAAAG CCATCGACTA TGACCCACTG 
GACAAGTTCA TCTACTGGGT GGATGGGCGC CAGAACATCA AGCGAGCCAA GGACGACGGG 
ACCCAGCCCT TTGTTTTGAC CTCTCTGAGC CAAGGCCAAA ACCCAGACAG GCAGCCCCAC 
GACCTCAGCA TCGACATCTA CAGCCGGACA CTGTTCTGGA CGTGCGAGGC CACCAATACC 
ATCAACGTCC ACAGGCTGAG CGGGGAAGCC ATGGGGGTGG TGCTGCGTGG GGACCGCGAC 
AAGCCCAGGG CCATCGTCGT CAACGCGGAG CGAGGGTACC TGTACTTCAC CAACATGCAG 
GACCGGGCAG CCAAGATCGA ACGCGCAGCG CTGGACGGCA CCGAGCGCGA GGTCCTCTTC 
ACCACCGGCC TCATCCGCCC TGTGGCCCTG GTGGTAGACA ACACACTGGG CAAGCTGTTC 
TGGGTGGACG CGGACCTGAA GCGCATTGAG AGCTGTGACC TGTCAGGGGC CAACCGCCTG 
ACCCTGGAGG ACGCCAACAT CGTGCAGCCT CTGGGCCTGA CCATCCTTGG CAAGCATCTC 
TACTGGATCG ACCGCCAGCA GCAGATGATC GAGCGTGTGG AGAAGACCAC CGGGGACAAG 
CGGACTCGCA TCCAGGGCCG TGTCGCCCAC CTCACTGGCA TCCATGCAGT GGAGGAAGTC 
AGCCTGGAGG AGTTCTCAGC CCACCCATGT GCCCGTGACA ATGGTGGCTG CTCCCACATC 
TGTATTGCCA AGGGTGATGG GACACCACGG TGCTCATGCC CAGTCCACCT CGTGCTCCTG 
CAGAACCTGC TGACCTGTGG AGAGCCGCCC ACCTGCTCCC CGGACCAGTT TGCATGTGCC 
ACAGGGGAGA TCGACTGTAT CCCCGGGGCC TGGCGCTGTG ACGGCTTTCC CGAGTGCGAT 
GACCAGAGCG ACGAGGAGGG CTGCCCCGTG TGCTCCGCCG CCCAGTTCCC CTGCGCGCGG 
GGTCAGTGTG TGGACCTGCG CCTGCGCTGC GACGGCGAGG CAGACTGTCA GGACCGCTCA 
GACGAGGCGG ACTGTGACGC CATCTGCCTG CCCAACCAGT TCCGGTGTGC GAGCGGCCAG 
TGTGTCCTCA TCAAACAGCA GTGCGACTCC TTCCCCGACT GTATCGACGG CTCCGACGAG 
CTCATGTGTG AAATCACCAA GCCGCCCTCA GACGACAGCC CGGCCCACAG CAGTGCCATC 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO. 



02092000A2_I_> 



WO 02/092000 



PCT/US02/14877 



43/89 



FIG. 24D 

HBMGI_2AS 

GGGCCCGTCA TTGGCATCAT CCTCTCTCTC TTCGTCATGG GTGGTGTCTA TTTTGTGTGC 
CAGCGCGTGG TGTGCCAGCG CTATGCGGGG GCCAACGGGC CCTTCCCGCA CGAGTATGTC 
AGCGGGACCC CGCACGTGCC CCTCAATTTC ATAGCCCCGG GCGGTTCCCA GCATGGCCCC 
TTCACAGGCA TCGCATGCGG AAAGTCCATG ATGAGCTCCG TGAGCCTGAT GGGGGGCCGG 
GGCGGGGTGC CCCTCTACGA CCGGAACCAC GTCACAGGGG CCTCGTCCAG CAGCTCGTCC 
AGCACGAAGG CCACGCTGTA CCCGCCGATC CTGAACCCGC CGCCCTCCCC GGCCACGGAC 
CCCTCCCTGT ACAACATGGA CATGTTCTAC TCTTCAAACA TTCCGGCCAC TGCGAGACCG 
TACAGGCCCT ACATCATTCG AGGAATGGCG CCCCCGACGA CGCCCTGCAG CACCGACGTG 
TGTGACAGCG ACTACAGCGC CAGCCGCTGG AAGGCCAGCA AGTACTACCT GGATTTGAAC 
TCGGACTCAG ACCCCTATCC ACCCCCACCC ACGCCCCACA GCCAGTACCT GTCGGCGGAG 
GACAGCTGCC CGCCCTCGCC CGCCACCGAG AGGAGCTACT TCCATCTCTT CCCGCCCCCT 
CCGTCCCCCT GCACGGACTC ATCCTGACCT CGGCCGGGCC ACTCTGGCTT CTCTGTGtCC 
CTGTAAATAG TTTTAAATAT GAACAAAGAA AAAAATATAT TTTATGATTT AAAAAATAAA 
TATAATTGGG ATTTTAAAAA CATGAGAAAT GTGAACTGTG ATGGGGTGGG CAGGGCTGGG 
AGAACTTTGT ACAGTGGAGA AATATTTATA AACTTAATTT TGTAAAACAG AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA GCGGCCGC 
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FIG. 25A 



ZMAXGI 3AS 



TCTAGACTCG AGGCGGCCGC CCATTGTGCA CTAAAGCGCC GCCGCCGCGC CATGGAGCCC 
GAGTGAGCGC GGCGCGGGCC CGTCCGGCCG CCGGACAACA TGGAGGCAGC GCCGCCCGGG 
CCGCCGTGGC CGCTGCTGCT GCTGCTGCTG CTGCTGCTGG CGCTGTGCGG CTGCCCGGCC 
CCCGCCGCGG CCTCGCCGCT CCTGCTATTT GCCAACCGCC GGGACGTACG GCTGGTGGAC 
GCCGGCGGAG TCAAGCTGGA GTCCACCATC GTGGTCAGCG GCCTGGAGGA TGCGGCCGCA 
GTGGACTTCC AGTTTTCCAA GGGAGCCGTG TACTGGACAG ACGTGAGCGA GGAGGCCATC 
AAGCAGACCT ACCTGAACCA GACGGGGGCC GCCGTGCAGA ACGTGGTCAT CTCCGGCCTG 
GTCTCTCCCG ACGGCCTCGC CTGCGACTGG GTGGGCAAGA AGCTGTACTG GACGGACTCA 
GAGACCAACC GCATCGAGGT GGCCAACCTC AATGGCACAT CCCGGAAGGT GCTCTTCTGG 
CAGGACCTTG ACCAGCCGAG GGCCATCGCC TTGGACCCCG CTCACGGGTA CATGTACTGG 
ACAGACTGGG GTGAGACGCC CCGGATTGAG CGGGCAGGGA TGGATGGCAG CACCCGGAAG 
ATCATTGTGG ACTCGGACAT TTACTGGCCC AATGGACTGA CCATCGACCT GGAGGAGCAG 
AAGCTCTACT GGGCTGACGC CAAGCTCAGC TTCATCCACC GTGCCAACCT GGACGGCTCG 
TTCCGGCAGA AGGTGGTGGA GGGCAGCCTG ACGCACCCCT TCGCCCTGAC GCTCTCCGGG 
GACACTCTGT ACTGGACAGA CTGGCAGACC CGCTCCATCC ATGCCTGCAA CAAGCGCACT 
GGGGGGAAGA GGAAGGAGAT CCTGAGTGCC CTCTACTCAC CCATGGACAT CCAGGTGCTG 
AGCCAGGAGC GGCAGCCTTT CTTCCACACT CGCTGTGAGG AGGACAATGG CGGCTGCTCC 
CACCTGTGCC TGCTGTCCCC AAGCGAGCCT TTCTACACAT GCGCCTGCCC CACGGGTGTG 
CAGCTGCAGG ACAACGGCAG GACGTGTAAG GCAGGAGCCG AGGAGGTGCT GCTGCTGGCC 
CGGCGGACGG ACCTACGGAG GATCTCGCTG GACACGCCGG ACTTCACCGA CATCGTGCTG 
CAGGTGGACG ACATCCGGCA CGCCATTGCC ATCGACTACG ACCCGCTAGA GGGCTATGTC 
TACTGGACAG ATGACGAGGT GCGGGCCATC CGCAGGGCGT ACCTGGACGG GTCTGGGGCG 
CAGACGCTGG TCAACACCGA GATCAACGAC CCCGATGGCA TCGCGGTCGA CTGGGTGGCC 
CGAAACCTCT ACTGGACCGA CACGGGCACG GACCGCATCG AGGTGACGCG CCTCAACGGC 
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FIG. 25B 

ZMAXGL3AS 

ACCTCCCGCA AGATCCTGGT GTCGGAGGAC CTGGACGAGC CCCGAGCCAT CGCACTGCAC 

CCCGTGATGG GCCTCATGTA CTGGACAGAC TGGGGAGAGA ACCCTAAAAT CGAGTGTGCC 

AACTTGGATG GGCAGGAGCG GCGTGTGCTG GTCAATGCCT CCCTCGGGTG GCCCAACGGC 

CTGGCCCTGG ACCTGCAGGA GGGGAAGCTC TACTGGGGAG ACGCCAAGAC AGACAAGATC 

GAGGTGATCA ATGTTGATGG GACGAAGAGG CGGACCCTCC TGGAGGACAA GCTCCCGCAC 

ATTTTCGGGT TCACGCTGCT GGGGGACTTC ATCTACTGGA CTGACTGGCA GCGCCGCAGC 

ATCGAGCGGG TGCACAAGGT CAAGGCCAGC CGGGACGTCA TCATTGACCA GCTGCCCGAC 

CTGATGGGGC TCAAAGCTGT GAATGTGGCC AAGGTCGTCG GAACCAACCC GTGTGCGGAC 

AGGAACGGGG GGTGCAGCCA CCTGTGCTTC TTCACACCCC ACGCAACCCG GTGTGGCTGC 

CCCATCGGCC TGGAGCTGCT GAGTGACATG AAGACCTGCA TCGTGCCTGA GGCCTTCTTG 

GTCTTCACCA GCAGAGCCGC CATCCACAGG ATCTCCCTCG AGACCAATAA CAACGACGTG 

GCCATCCCGC TCACGGGCGT CAAGGAGGCC TCAGCCCTGG ACTTTGATGT GTCCAACAAC 

CACATCTACT GGACAGACGT CAGCCTGAAG ACCATCAGCC GCGCCTTCAT GAACGGGAGC 

TCGGTGGAGC ACGTGGTGGA GTTTGGCCTT GACTACCCCG AGGGCATGGC CGTTGACTGG 

ATGGGCAAGA ACCTCTACTG GGCCGACACT GGGACCAACA GAATCGAAGT GGCGCGGCTG 

GACGGGCAGT TCCGGCAAGT CCTCGTGTGG AGGGACTTGG ACAACCCGAG GTCGCTGGCC 

CTGGATCCCA CCAAGGGCTA CATCTACTGG ACCGAGTGGG GCGGCAAGCC GAGGATCGTG 

CGGGCCTTCA TGGACGGGAC CAACTGCATG ACGCTGGTGG ACAAGGTGGG CCGGGCCAAC 

GACCTCACCA TTGACTACGC TGACCAGCGC CTCTACTGGA CCGACCTGGA CACCAACATG 

ATCGAGTCGT CCAACATGCT GGGTCAGGAG CGGGTCGTGA TTGCCGACGA TCTCCCGCAC 

CCGTTCGGTC TGACGCAGTA CAGCGATTAT ATCTACTGGA CAGACTGGAA TCTGCACAGC 

ATTGAGCGGG CCGACAAGAC TAGCGGCCGG AACCGCACCC TCATCCAGGG CCACCTGGAC 

TTCGTGATGG ACATCCTGGT GTTCCACTCC TCCCGCCAGG ATGGCCTCAA TGACTGTATG 
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FIG. 25C 



ZMAXGI 3AS 



CACAACAACG GGCAGTGTGG GCAGCTGTGC CTTGCCATCC CCGGCGGCCA CCGCTGCGGC 
TGCGCCTCAC ACTACACCCT GGACCCCAGC AGCCGCAACT GCAGCCCGCC CACCACCTTC 
TTGCTGTTCA GCCAGAAATC TGCCATCAGT CGGATGATCC CGGACGACCA GCACAGCCCG 
GATCTCATCC TGCCCCTGCA TGGACTGAGG AACGTCAAAG CCATCGACTA TGACCCACTG 
GACAAGTTCA TCTACTGGGT GGATGGGCGC CAGAACATCA AGCGAGCCAA GGACGACGGG 
ACCCAGCCCT TTGTTTTGAC CTCTCTGAGC CAAGGCCAAA ACCCAGACAG GCAGCCCCAC 
GACCTCAGCA TCGACATCTA CAGCCGGACA CTGTTCTGGA CGTGCGAGGC CACCAATACC 
ATCAACGTCC ACAGGCTGAG CGGGGAAGCC ATGGGGGTGG TGCTGCGTGG GGACCGCGAC 
AAGCCCAGGG CCATCGTCGT CAACGCGGAG CGAGGGTACC TGTACTTCAC CAACATGCAG 
GACCGGGCAG CCAAGATCGA ACGCGCAGCC CTGGACGGCA CCGAGCGCGA GGTCCTCTTC 
ACCACCGGCC TCATCCGCCC TGTGGCCCTG GTGGTAGACA ACACACTGGG CAAGCTGTTC 
TGGGTGGACG CGGACCTGAA GCGCATTGAG AGCTGTGACC TGTCAGGGGC CAACCGCCTG 
ACCCTGGAGG ACGCCAACAT CGTGCAGCCT CTGGGCCTGA CCATCCTTGG CAAGCATCTC 
TACTGGATCG ACCGCCAGCA GCAGATGATC GAGCGTGTGG AGAAGACCAC CGGGGACAAG 
CGGACTCGCA TCCAGGGCCG TGTCGCCCAC CTCACTGGCA TCCATGCAGT GGAGGAAGTC 
AGCCTGGAGG AGTTCTCAGC CCACCCATGT GCCCGTGACA ATGGTGGCTG CTCCCACATC 
TGTATTGCCA AGGGTGATGG GACACCACGG TGCTCATGCC CAGTCCACCT CGTGCTCCTG 
CAGAACCTGC TGACCTGTGG AGAGCCGCCC ACCTGCTCCC CGGACCAGTT TGCATGTGCC 
ACAGGGGAGA TCGACTGTAT CCCCGGGGCC TGGCGCTGTG ACGGCTTTCC CGAGTGCGAT 
GACCAGAGCG ACGAGGAGGG CTGCCCCGTG TGCTCCGCCGC CCAGTTCCCC TGCGCGCGG 
GGTCAGTGTGT GGACCTGCGC CTGCGCTGCG ACGGCGAGGC AGACTGTCAG GACCGCTCA 
GACGAGGCGGA CTGTGACGCC ATCTGCCTGC CCAACCAGTT CCGGTGTGCG AGCGGCCAG 
TGTGTCCTCAT CAAACAGCAG TGCGACTCCT TCCCCGACTG TATCGACGGC TCCGACGAG 
CTCATGTGTGA AATCACCAAG CCGCCCTCAG ACGACAGCCC GGCCCACAGC AGTGCCATC 
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FIG. 25D 



ZMAXGI 3AS 



GGGCCCGTCA TTGGCATCAT CCTCTCTCTC TTCGTCATGG GTGGTGTCTA TTTTGTGTGC 
CAGCGCGTGG TGTGCCAGCG CTATGCGGGG GCCAACGGGC CCTTCCCGCA CGAGTATGTC 
AGCGGGACCC CGCACGTGCC CCTCAATTTC ATAGCCCCGG GCGGTTCCCA GCATGGCCCC 
TTCACAGGCA TCGCATGCGG AAAGTCCATG ATGAGCTCCG TGAGCCTGAT GGGGGGCCGG 
GGCGGGGTGC CCCTCTACGA CCGGAACCAC GTCACAGGGG CCTCGTCCAG CAGCTCGTCC 
AGCACGAAGG CCACGCTGTA CCCGCCGATC CTGAACCCGC CGCCCTCCCC GGCCACGGAC 
CCCTCCCTGT ACAACATGGA CATGTTCTAC TCTTCAAACA TTCCGGCCAC TGCGAGACCG 
TACAGGCCCT ACATCATTCG AGGAATGGCG CCCCCGACGA CGCCCTGCAG CACCGACGTG 
TGTGACAGCG ACTACAGCGC CAGCCGCTGG AAGGCCAGCA AGTACTACCT GGATTTGAAC 
TCGGACTCAG ACCCCTATCC ACCCCCACCC ACGCCCCACA GCCAGTACCT GTCGGCGGAG 
GACAGCTGCC CGCCCTCGCC CGCCACCGAG AGGAGCTACT TCCATCTCTT CCCGCCCCCT 
CCGTCCCCCT GCACGGACTC ATCCTGACCT CGGCCGGGCC ACTCTGGCTT CTCTGTGCCC 
CTGTAAATAG TTTTAAATAT GAACAAAGAA AAAAATATAT TTTATGATTT AAAAAATAAA 
TATAATTGGG ATTTTAAAAA CATGAGAAAT GTGAACTGTG ATGGGGTGGG CAGGGCTGGG 
AGAACTTTGT ACAGTGGAGA AATATTTATA AACTTAATTT TGTAAAACAG AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA GCGGCCGC 
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FIG. 26A 

Alignment of Human and Mouse Zmaxl (LRP5) 
Amino-Acid Sequences. 
Top= Human, Bottom = Mouse 



1 MEA APPGPPWPLLLLLLLLLALCGCPAPAAASPLLLFANRRDVRLV 46 

iH" 1 



1 METAPT 



RAPPPPPPPLLLLV LYCSLVPAAASPLLLP 



47 DAGGVKLEST 

ill 



46 DAGGVKLEST 
97 AAVQNVV 
96 AAAQNIV 



VVSGLEDAAAVDFQFSKGAVYW 

I - M 1 1 M I M M I M f I / M ( 



V A S G L E D AAAV D F Q F S KG AV YW 



DVSEEAIKQTYLNQ' 



G 96 
G 95 



SGLVSPDGLACDWVGKKLYWTDSETNR 

I I I I I I I I I I I I I I I I i I I I I I I I I I I 
SGLVSPDGLACDWVGKKLYWTDSETNR 



EVANLNGTSRKVLF 146 

I I I !! I I I I I I I I! 

EVANLNGTSRKVLF 145 



147 WQDLDQPRAIALDPAHGYMYW 
III II I II II I II 



iiiiimi mi 

146 WQDLDQPRAIALDP 



DWGETPRIERAGMDGS 



I I I I • I I I I I I I I I I 

AHGYMYWTDWGEAPRIERAGMDGS 



RKIIVDSD 

I I I I I I I I 
RKIIVDSD 



YW 196 
I I 

YW 195 



197 PNGLTIDLEEQKLYWADAKLSFIHRANLDGSFRQKVVEGSLTHPFALTLS 246 

W 6 mmmmmkmmmmnkiTVs 2 *s 

247 GDTLYWTDWQTRSIHACNKRTGGKRKEILSALYSPMDIQVLSQERQPFFH 296 
246 GiTiWrWUIliW^T^QiMiaiiliWiliWii^WpW 295 
297 TRCEEDNGGCSHLCLLSPSEPFYTCACPTGVQLQDNGRTCKAGAEEVLLL 346 



296 

347 ARR 
I I I 

346 ARR 



DLRR 

I I I I 
DLRR 



SLDTPDF 

I I I I I I I 
SLDTPDF 



DIVLQVDD 
I I I I I I : I 



DIVLQVGDIRHAIA 



RHA 
I I 



AI DYD P L E GYVYWTDDE VRA 396 



,1111111... 
)YDPLEGYVYW 



4i 



DEVRA 3 95 



397 IRRAYLDGSGAQ - 

I I I I I I I I I JL I JL 

396 IRRAYLDGSGAQ 



LVNTE 



LVN 



4 E 



N D P DG I AVDWVARN L YW 

I I I I I I I I I I I t I 1. 1 I I 
NDPDGIAVDWVARNLYW 



DR 

I I 
DR 



EVTRLN 446 

41 



EV 



ILN 445 



447 GTSRK 



446 



LVSEDLDEPRA 



ALHPVMGLMYW 

•IIMIIIIII 
VLHPVMGLMYW 



DWGENPK 

MM 



ECANLDGQERRV 496 
ECANL.DGRDRHV 495 



497 LVNASLGWPNGLALDLQEGKLYWGDAKTDKIEV 



496 mmmmimk 



DK 



EV 



NVDG 



KRR 



LLEDKLP 546 



)KLP 545 



547 H 
I 

546 H 



FGF 



LLGDF 



YWTDWQRRS I E RVH KVKAS RD V IIDQLPDLMGL KAVNV 596 



597 AKVVGTNPCADRNGGCSHLCFFTPHATRCGCPIGLELLSDMKTCIVPEAF 646 
596 MGTWaDGiGGCsWc^T^MAc^siitaWWziiiW 645 



647 LVFTSRAAIHRISLE 

646 wwjuitimiit 



NNNDVAIPL 

lllllllll 
NNNDVAIPL 



GVKEASALDFDVSNNH 

LLLilllllllilLLI 



GVKEASALDFDVSNNH I YWTDVSL 695 



YWTDVSL 696 
II 
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FIG. 26B 

Alignment of Human and Mouse Zmaxl (LRP5) 
Amino-Acid Sequences. 



697 KT 
696 K 



SRAFMNGSSVEHVVEFGLDYPEGMAVDWMGKNLYWADTGTNRIEVAR 746 



747 LDGQFRQVLVWRDLDNPRSLALDP 

, r I I I I I I I I I I I I I I I I I I I I I I I I 
746 LDGQFRQVLVWRDLDNPRSLALDP 



797 M 
796 M 



LVDKVGRANDL 



IDYADQRLYW 
L V D K V G PA HOLT IDYADQRLYW' 



KGY 

DLD 

III 

DLD - 



YWTEWGGKPR 



"EWGGRPR 



VRAFMDGTNC 796 

I I I I I I I I I I 
VRAFMDGTNC 795 



NMIESSNMLGQERVVIADDLP 846 

I i I I I ( I f I I I I I : I I I ( I I I 
NMIESSNMLGQERMVIADDLP 845 



847 HPFGLTQYSDYIYWTDWNLHSIERADK 

• I I I I Mill.. 
846 YPFGLTQYSDYIYW 



"dwnlAsi eradk 



TSGRNR1 


rL: 


[QGHLDFVMD] 


CLVFH 


896 


TSGRNR1 


1 

rL] 


mm: 


1 1 1 1 

[LVFH 


895 



897 SSRQDGLNDCMHNNGQCGQLCLAIPGGHRCGCASHYTLDPSSRNCSPPTT 946 



947 FLLFSQKSA 

94 6 imm 



SRM 

OA 



PDDQHSPDLILPLHGLRNVKA 

llll I I I I :| I I I I I I I I I I 
PDDQLSPDLVLPLHGLRNVKA 



945 



DYDPLDKFIYWVDG 996 

:|IHIIIIIIIII 
NYDPLDKFIYWVDG 995 



997 RQN 
II 



KRAKDDGTQPFVL 
I I III II III -I 



996 RQN I KRAKDDGTQP SM L 



SLSQGQNPDRQPHDLS 
Ml. ilJMIIIM 



SPSQSLSPDRQPHDLSIDIYSR 



YSR 
I I 



LFWTCEA 



II I I I II I 
LFWTCEATN 1045 



N 1046 



1047 
1046 



NVHRLSGEAMGVVLRGDRDKPRAIVVNAERGYLYF 



nvhrldgdamgWlrgdrdW 



NMQDRAAK 



ERA 1096 
III 

ERA 1095 



1097 ALDGTEREVLFTTGLIRPVALVVDNTLGKLFWVDADLKRIESCDLSGANR 1146 

i 9 96 iUmmTrmmmLmmimmm 



1147 L 



1146 L 



LEDAN 



II. I llll 



LEDAN 



VQPLGL 



ILGKHLYWIDRQQQM 



CDLSGANR 1145 
GDKRTRIQGRVA 1196 



ERVEK 

1= RV E l<CTTG I) ^RTRVQG RVT 1195 



I I I : II : I I : I II I I I I I I I I 
[VQPVGLTVLGRHLYWIDRQQQM] 

1197 HLTGIHAVEEVSLEEFSAHPCARDNGGCSHICIAKGDGTPRCSCPVHLVL 1246 

. I I I II I I I I M I I I M I I I I I I I I I I I I I 111(1 
1196 HL1 



1247 LQNLL 



CGEPPTCSPDQFACA' 



i 2 46 WWWWWiWWrUI 



TGE] 


[DCI 


II 

TGE] 


.Ail 



PGAWRCDGFPECDDQSDEEGCP 1296 



1297 VCSAAQFPCARGQCVDLRLRCDGEADCQDRSDEADCDAICLPNQFRCASG 1346 

LI I ^1 11 1 ! 1 11 l.KJ I J I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = 1 1 1 : 1 1 1 i I I J J 

CQDRSDEANCDAVCLPNQFRC 



1296 VCSASQFPCARGQCVDLRLRCDGEAD 
1347 QCVLIKQQCDSFPDCIDGSDELMCE 



1346 QcWlHCQQCDSPPDCAD 



mm 



TKPPSDDSPAHSSAIGPVIG 

III MUM III I 
SDDIPAHSSAIGPVIG 



NKPP5 



1397 LFVMGGVYFVCQRVVCQRYAGANGPFPHEYVSGTPHVPLNF 

1396 [miiWWAAAWAAItasiWWiWiiAWWUi 



TSG 1345 

LS 1396 

LS 1395 

APGGSQHG 1446 

AP AiiAAA 1445 
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FIG. 26C 

Alignment of Human and Mouse Zmaxl (LRP5) 
Amino-Acid Sequences. 



1447 
1446 
1497 
1496 
1547 
1546 
1597 
1596 



PFTGIACGKSMMSSVSLMGGRGGVPLYDRNHVTGASSSSSSS 



I I • I I : I : I I : II I : I I : I I I I : I I I I I I I I I 
PFPGIPCSKSVMSSMSLVGGRGSVPLYDRNHV 



MINIMI 
GASSSSSSS 



KA 



LYPP 1496 
MM 

LYPP 1495 



LNPPPSPA' 



DPSLYNMDMFYSSN 



PATARPYRPYI 



RGMAPP 



PCS 



D 1546 



VCDSDYSASRWKASKYYLDLNSDSDPYPPPPTPHSQYLSAEDSCPPSPA 

III 

VCD 



1596 



II Mill illl-IMMMMMMMMIIMMMIMMIIMI: 
/CDSDYSISRWKSSKYYLDLNSDSDPYPPPPTPHSQYLSAEDSCPPSPGT 1595 

ERSYFHLFPPPPSPCTDSS 1615 

1614 
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FIG. 27A 

Alignment of Human LRP5 and LRP6 
Top = LRP5, Bottom = LRP6 
Underlined = region marked for antibody production 
Percent Similarity: 83.542 Percent Identity: 70.901 



1 MEAAPPGPPWPLLLLLLLLLALCGCPAPAAASPLLLFANRRDVRLVDAGG 50 

: * : I llll:.. I • I I II : I I I II : I I I I I • • 
1 MGAVLRSLLA. CSFCVLLRAAPLLLYANRRDLRLVDATN 38 



51 VKLESTIVVSGLE DAAAVD F Q F S KGAVYWTD VS E E A 

I I : I I I I I I I I I I I I • I : I I • I I I I II 
VVGGLEDAAAVDFVFSHGLIYWSDVSEEA 



39 GKENAT 



KQTYLNQTGAAVQ 100 

I- I :|.|: -II 
KRTEFNKTE . SVQ 87 



101 NVVISGLVSPDGLACDWVGKKLYWTDSETNRIEVANLNGTSRKVLFWQDL 150 
88 ^iti^iWiiEUWliiWAlWsiibisLiWiWli 137 



ALDP AHGYMYW 



DWGETPR 



ERAGM DGSTRKIIVDSDIYWPNGL 200 



151 dqpra: 

I I I I I llll- I : I 1 1 I I I I : I : I I I I I 1 1 I • I I : : I : 
138 DQPRAIALDPSSGFMYWTDWGEVPKIERAGMDGSSRFIIINSE 



YWPNGL 187 



201 
188 



IDLEEQKLYWADAKLSFIHRANLDGSFRQKVVEGSLTHPFALTLSGDTL 250 



251 YW 
238 YW' 



DWQ' 
DWS' 



RS 
Hi 



HACNKR 
III 



LACNKYTGEGLR 



GGKRKE 
I: . 11 



LSALYSPMDIQVLSQERQPFFHTRCE 300 

III :.:||:IM -I: 
DIFSPMDIHAFSQQRQPNATNPCG 287 



301 E DNGGCSHLCLLSPSEPFYTCA CPTGVOLODNGRTCKAGAEEVLLLARR 



288 IPNGGCS 

! SLDTPDFTDI VLQVDD 



.111:11 -III I . 
HLCLMSPVKPFYQCACP 



GVKLIE 



350 
337 



351 DLRR 
338 Dll.RR 



mm 



RHA 

I I I 
RHA 



D YD P L E GYVYWTDD E VRA 
II 



DYDPVEGYIYW 



401 YLDGSGAOTLVNTE ] 


[NDPDGJ 


[ AVDWVARN L YW" 


rD" 


rG* 


rDR] 


[EVT 


:: 1 1 1 I - J : 1 • • : 
388 FIDGSGSQFWTAQ] 


••III 
[AHPDG] 


1 1 1 1 1 1 1 1 1 1 1 
[AVDWVARN LYW1 


1 

rDi 


1 

rGi 


1 1 

rDR] 





451 K 
438 l< 



LVSEDLDEPRA 



ALHPVMGLMYWTDWGENPK 



RRA 400 
RRS 387 
SR 450 
rUgImA 437 



ECANLDGQERRVLVNA 500 



DDEV 
RLNG 



LiSEDLEEPRAIVLDPMVGYMY^ 487 
JGLALDLQEGKLYWGDAKTDKIEVINVDGTKRRTLLEDKLPH 



DKIEVINVDG 

II I I : I • I I 
EVMNTDG 



DK 



FG 550 
GRRVLVEDKI PHlFG 537 



551 FTLLGDFIYWTDWQRRSIERVHKVKASRDV 

I I I I I I :: I I I I I I 1 1 I I I I I I • I • I : I 
538 FTLLGDYVYWTDWQRRSIERVHKRSAEREV 



DQLPDLMGLKAVNVAKVV 600 

lllllllllll.il :|: 
DQLPDLMGLKATNVHRVI 587 



601 GTNPCADRNGGCSHLCFFTPHATRCGCP 



588 GSNPCAEEl!lGGCSHLCLYRPQGLRCACPIGFE^ 



GLELLSDMK 



CIVPEAFLVFT 650 

I I I I I I : I • 
VPEAFLLFS 637 
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FIG. 27B 

Alignment of Human LRP5 and LRP6 



651 


SRAA] 


[HR] 


[SLE1 


rNNNDVA] 


[PLT 


638 


•II- 
RRAD] 


: 1 
[RR] 


1 1 1 

[SLE1 


1 1 i : 1 1 
fNNNNVA] 


■Air 



GVKEASALDFDVSNNH 



YWTDVSLK 



701 RAFMNGSSVEHVVEFGLDYPEGMAVDWMGKNLYWAD 

68 8 miLmmmmMmki, 



DVTDNRIYW 

G 
I 

G 



NR 

I I 
NR 



S 700 

III I 

DISLKTIS 687 

EVARLDGQ 750 
737 



751 FRQVLVWRDLDNPRSLALDPTKGYIYW 



7B8 HiiWWkWAs^iJliWAE^HW!^iWklD^iisiRTT l , 



EWGGKPRIVRAFMDGTNCM 



IV 800 



801 DKVGRANDLTIDYADQRLYWTDLD 
788 PNVi 
851 L 
838 L 



LV 787 



NMIESSNMLGQERVVIADDLPHPFG 850 



TQYSDYIYW1 


"DWNLH5: 


[ERADK1 


rSGRNRl 


-|_: 


[QGHLDFVMD] 
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TQYQDYIYW1 


*di!/srrs: 




11-11 
rSGQNRl 


"i: 


1 1 1 1 1 : 1 1 1 

[QGHLDYVMD] 



ADDLPflPFG 837 

LVFHSSRQ 900 

llllllll 
LVFHSSRQ 887 



901 DGLNDCMHNNGOCGOLCLAIP . GGHRCGCASHYTLDPSSRNC SPPTTFLL 949 
888 SGWtllECASSNGHCSHLCLAVPVGGFVCGCPAHisL^ 937 



950 FSQKSA 
938 FSQKSA 



SRMIPDDQHSPDL 
NrAviAeQQSPDI I L P I HSLRNV 



LPLHGLRNVKAIDYDPLDKFIYWVDGRQN 999 



JliJ.iJU.UI IJIIIII :||:|:||| 

RAIDYDPLDKQLYWIDSRQN 987 



1000 



988 M 
1048 
1038 



KRAKDDGTOPF . VLTSLSOGO NPDROPHDLS 



RKAQEDGSQGFTVWSSVPSQlilLEIQPYDLsU 



YSRTLFWTCEA 



NVHRLSGEAMGVVLRGDRDKPRAIVVNAERGYLYF 



NVTRLDGRSVGVVLkGEQDRPRAIVVNPEKGY^ 1087 



NMQDRAAK 



NT 1047 
I • 

NV 1037 
ERAA 1097 



1098 LDGTEREVLFTTGLIRPVALVVDNTLGKLFWVDADLKRIESCDLSGANRL 1147 

I I I I I I I I I I -II : I : I I • : I . I I I I I I - I • I I : I 
1088 ldgterevlffsglskpialaldsrlgklfwadsdlrr: 



1148 TLEDANIVQPLGL 



1138 VLEDSN 



ILGKHLYWIDRQQQM 



LQPVGLivFENWLWlDKQQQA 



ERVEK' 

|:::. 

EKIDM' 



SSDLSGAlllRi 1137 

TGDKRTRI QGRVAH 1197 
. : | : : | : | : | : 

GREGRTKVQARIAQ 1187 



1198 LTG 

1188 isDI HAVKE L M LQE YRQH PCAQDNGG C SH 



HAVE E VS L E E F SAHPCARDNGGC SH 

I I • I : • I : I : • • I I I I • I I I I I I I 
"TAQDNG 



CIAKGDG 



CLVKG 



AA 



PRCSCPVHLVLL 1247 

• I I I I I : I I I I I 
TRCSCPMHLVLL 1237 



1248 QN.LLTCGEPP 

1238 AAeAsAAAAAIA 



CSP DOFACATGE 



I I : I I • I III 
SPQQFTCFTGE 



DCIPGAWRCDG FPECDDOSDEEGCPV 1297 

AAiAvIAAAAAAtAAeA^AAAlnAAA 128 7 



1298 CS AAOFPCARGOCVDLRLRCDGFA DrODRSDFADrDATf I PNOFRCASKO 1347 



1288 



CSESQFQCAsMciDGALRCNGDANMDKSDEK^^ 



1337 



1348 CVLIKOOCDSFPDCIDGSDFI MP FT 

I : ••■II -11111111 
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KPPSDDSPAHSSAIGPVIGIILSL 1397 

..| .1. ...:|.| 1: :.: 

E E P. . .APQATNTVGSVIGVIVTI 1384 
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FIG. 27C 

Alignment of Human LRP5 and LRP6 
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Model of Wnt signaling 



FIG. 28 

SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 02092000A2_I_> 



WO 02/092000 



PCT/USO 2/1 4877 



55/89 




o 

CO 




'r <? "V <r ^ j / ^ 





3 
B 



3 
B 



.-J 



© 



»/-> 

I-}. 



© 



t-4 



la 

+ + 



+ + 



i 

• r— ( 

I 1 



CO 

13 > 

S o 

o o 

'I '5 

is B 



1— I .• \ ,_j 

+ §>f 

OX) 



I 

o 



> 



o 

» f— ( 

I 



o 

> o 

.a « 



a 
o 



8 



o 



o 

CM 



O 
O 



o o o o o 
. ^ ^_ °° <o c\j M 

(roti[-Bnni9-JI / ptq- jox jo oiif^) f, 



uoipnpui xotii-jox 



o 

(2 



1 § 



CM 

O 

LL 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCIO: <WO 02092000A2_I_> 



WO 02/092000 



PCT/US02/14877 



56/89 



CO 

O 
O 



•a da 

i s 



Oh 

o 

s 



sfs'sff'tT* ■■■ /■ «■" ^« J'fffJ'l 
• 5>?>I>5>5> 5> - s 1>5> t> t> ?> 1> ■» 
■■>■>■>•>•> ■■■ »s •% ■%■% ■_■» »s«s ■■ 



**- ■- ■- ■- ■- -- -- ■- ■- ■- ■- 





00 

s 



IT) 




> 

00 * 



B 



h-3 



+ 



00 

e 
w 



V) 



0 

+ 



> 

00 A 

s 
a 

m 
i-3 



+ 



o 



CO 

O 
Oh 
t-i 
O 

s 



C/3 
CO 

O 
CD 



22 m 



oo 



CO 



CD 

— i' Os 
M P 
^ O 

CO 

12 ° 

3 .co 

►9 « 



o o oooo oo o o 

OOCD^-CNOCOCO^CM w 

t- t- t- t- <r- g 

(ioni-Bnixi9^[ / ptit-.jox jo opra) « 

uoipnpur loni-jDX § 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO. 



.020920O0A2J_> 



WO 02/092000 



57/89 



PCT/US02/14877 



Gene 


Genbank 
Accession # 


Protein 
Accession # 


granulin 


M75161 


AAA58617 


similar to cys/His rich protein 


BC004544 


AAH04544 


IGF-BINDING PROTEIN 6 


M69054 


AAA88070 


latent TGFb binding protein 4 


AF051344 


AAC39879 


NOTCH 2 


AF315356 


AAG37073 


fibulin 1 


X53743 


CAA37772 


MDC15(ADAM15) 


! U46005 
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DKFZp761G02121(notch1 Ca++ binding like) 


AL137311 


CAB70690 


chordin 


AF076612 


AAC69835 


fibronectin 1 


U42594 
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MG50(melanoma associated antigen) 


| AF200348 


AAF06354 


unknown (notch 4-like) 


AX068260 


CAC27245 


Slit 1 


AB017167 


BAA35184 


tomoregulin (agarin repeat homology) 


AB004064 


BAA90820 


sprouty 1 
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AAC39566 


sprouty 2 


AF039843 


AAC04258 


NOV1 


X96584 


CAA65403 


agrin 


AF016903 


AAC39776 


fibrillin 1 


L13923 


AAB02036 


thrombospondinl 


X04665 


CAA28370 


ADAM 19 


AF134707 


AAF22162 


Nafl alpha 


AJ011895 


CAA09855 


laminin alpha 5 


Z95636 


CAB09137 


CRIM1 


AF167706 


AAF34409 


nidogen 


M30269 


AAA59932 


fibulin-2 


X82494 


CAA57876 


thrombospondin 2 


L12350 


AAA03703 


\ KIAA1323 


AB037744 


BAA92561 ! 


fihrillin-9 
IIUI lllllr£ 


1 I/T3070 


AAA! oyoU 


MEGF9 


AB011542 


BAA32470 I 


integrin beta 1 


X07979 


CAA30790 I 


matrilin-2 precursor 


U69263 


AAC51260 


tenascin 


X56160 


A32160 
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zmaxLBD 
1,4 

screen X 

peptide 

library 



Peptide Number 


Peptide Sequence 


No. of Hits 


9 


VVLCSRCGRLWRWSCG 


1 


12 


EVRQVTCIRCRRGFLL 


1 


13 


GGGGMWEAWSCYACG 


1 


14 


GWRWCGRCGALWWRRV 


3 
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Listed are the pcDNA3.1 construct names followed by the DNA 
sequence 

OST258 (control for OST 259-OST262 and OST264 , OST265) 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTT C CAGGTT CC ACTGGTGACGGATC C 

OST2 5 9 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCATGAGCGATAAAATTATTCACCTGA 
CTGACGACAGTTTTGACACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTC 
GATTT CTGGGC AGAGTGGTGCGGTC CGAATTC CGTGGTT CTGTGTT CGCGTTG 

TGGGCGTTTGTGGCGGTGGTCGTGTGGGACTAGTGGTCCGTGCAAAATGATCG 

CCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCA 

AAACTGAACATCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGG 

TATCCCGACTCTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGG 

GTGCACTGTCTAAAGGTCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAA 
GCGGCCGC 

OST2 60 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCATGAGCGATAAAATTATTCACCTGA 
CTGACGACAGTTTTGACACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTC 
GATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCGGGTGGCGGTGGTGTGGTCG 
GTGTGGGGCTTTGTGGTGGCGGCGTGTTACTAGTGGTCCGTGCAAAATGATCG 
CCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCA 
AAACTGAACATCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGG 
TATCCCGACTCTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGG 
GTGCACTGTCTAAAGGTCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAA 
GCGGCCGC 

OST261 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCATGAGCGATAAAATTATTCACCTGA 
CTGACGACAGTTTTGACACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTC 
GATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCGAGGTGCGGCAGGTTACGTG 
TATTAGGTGTCGTCGGGGTTTTCTGTTGACTAGTGGTCCGTGCAAAATGATCG 
CCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCA 

FIG. 39A 
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AAACTGAACATCGATCAAAA.CCCTGGCACTGCGCCGAAATATGGCATCCGTGG 
TATCCCGACTCTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGG 
GTGCACTGTCTAZ^AGGTCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAA 
GCGGCCGC 

OST262 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCATGAGCGATAAAATTATTCACCTGA 
CTGACGACAGTTTTGACACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTC 
GATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCGGTGGTGGGGGGATGATTTG 
GGAGGCTTGGAGTTGTTATGCGTGTGGGACTAGTGGTCCGTGCAAAA.TGATCG 
CCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCA 
AAACTGAACATCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGG 
TATCCCGACTCTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGG 
GTGCACTGTCTAAAGGTCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAA 
GCGGCCGC 

OST2 63 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCATGAGCGATAAAATTATTCACCTGA 
CTGACGACAGTTTTGACACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTC 
GATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCTTGTGGATTGGGCCGGGTGA 
TCAGGGTCTGTTTCGGCGTTTTGTTTTTACTAGTGGTCCGTGCAAAATGATCG 
CCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCA 
AAACTGAACATCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGG 
TATCCCGACTCTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGG 
GTGCACTGTCTAAAGGTCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAA 
GCGGCCGC 

OST264 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCGTGTCTTCTGATCAAAATCATTTCC 
GAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGATCATAGCACC 
TTGGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAA 
AGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGT 
GTTGTGCTAGACACTTCTGGTCCAAGATCTGTAAACCTGTCCTGAAAGAAGGT 
CAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTCATGGACTAGAAATATTCCA 
GCGTTGTTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGAAAGATCACCATC 
AAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGAGACACTAAGCGGCCGC 

FIG. 39B 
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OST265 



AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACTGGTGACGGATCCTGCGCTAGTCCCACCCGCGGAGGGG 
ACGCGGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACGCTGCATG 
CGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTC 
TTCTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCT 
TTGGTAATGATCATAGCACCTTGGATGGGTATTCCAGAAGAACCACCTTGTCT 
TCAAAAATGTATCACACCAAAGGACAAGAA.GGTTCTGTTTGTCTCCGGTCATC 
AGACTGTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCAAGATCTGTA 
AACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCT 
CATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTCTTGCTA 
AGCGGCCGC 



AAGCTTGCCACCATGGGCGATAAAATTATTCACCTGACTGACGACAGTTTTGA 
CACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTCGATTTCTGGGCAGAGT 
GGTGCGGTCCGAATTCCTATGCGTGGTTGTTTTCTTGTAGTAGGTGTAGGTGG 
TGGTTGCCTTGGACTAGTGGTCCGTGCAAAATGATCGCCCCGATTCTGGATGA 
AATCGCTGACGAATATCAGGGCAAACTGACCGTTGCAAAACTGAA.CATCGATC 
AAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGGTATCCCGACTCTGCTG 
CTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGGGTGCACTGTCTAAAGG 
TCAGTTGAAA.GAGTT C CTCGACGCTAACCTGGCGTAAGCGGC CGC 



AAGCTTGC C AC CATGGGCGATAAAATTATTCAC CTGACTGACGACAGTTTTGA 
C ACGGATGTACT C AAAGCGGACGGGGCGATC CT CGT CGATTT CTGGGC AGAGT 
GGTGCGGTC CGAATT C CATTTGTGAGGTTGTGAGGTTGTGGAGTCGGT AT C CT 
TGGT CTTGGGTGACTAGTGGT C CGTGC AAAATGATCGCC C CGATT CTGGATGA 
AATCGCTGACGAATATCAGGGCAAACTGACCGTTGCAAAACTGAACATCGATC 
AAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGGTATCCCGACTCTGCTG 
CTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGGGTGCACTGTCTAAAGG 
TCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAAGCGGCCGC 



OST2 66 



OST2 67 
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OST268 

AAGCTTGCCACCATGGGCGATAAAATTATTCACCTGACTGACGACAGTTTTGA 
CACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTCGATTTCTGGGCAGAGT 
GGTGCGGTCCGAATTCCGGTTGTACTAGTGCGGTGTGTGGTGCTTGGGCTGAG 
GCGGGTAGGTTTTATTGTACTAGTGGTCCGTGCAAAATGATCGCCCCGATTCT 
GGATGAAATCGCTGACGAATATCAGGGCAAACTGACCGTTGCAAAACTGAACA 
TCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGGTATCCCGACT 
CTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGGGTGCACTGTC 
TAAAGGTCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAAGCGGCCGC 

OST269 (irrelevant control peptide for OST266-OST268) 

AAGCTTGCCACCATGGGCGATAAAATTATTCACCTGACTGACGACAGTTTTGA 

CACGGATGTACTCAAAGCGGACGGGGCGATCCTCGTCGATTTCTGGGCAGAGT 

GGTGCGGTCCGAATTCCTTGTGGATTGGGCCGGGTGATCAGGGTCTGTTTCGG 

CGTTTTGTTTTTACTAGTGGTCCGTGCAAAATGATCGCCCCGATTCTGGATGA 

AATCGCTGACGAATAT CAGGGCAAACTGAC CGTTGC AAAACTGAACAT CGATC 

AAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGGTATCCCGACTCTGCTG 

CTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGGGTGCACTGTCTAAAGG 

TCAGTTGAAAGAGTTCCTCGACGCTAACCTGGCGTAAGCGGCCGC 



FIG. 39D 
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Listed below are the amino acid sequences corresponding to the 
pcDNA3.1 constructs in Appendix 1A 

OST258 

METDTLLLWVLLLWVPGSTGDGS 
OST25 9 

METDTLLLWLLLWPGSTGDGSMSDKIIHLTDDSFDTDVLKADGAILVDFWA 
EWCGPNSVVLCSRCGRLWRWSCGTSGPCKMIAPILDEIADEYQGKLTVAKLNI 
DQNPGTAPKYGI RGI PTLLLFKNGEVAATKVGALSKGQLKEFLDANLA 

OST2 6 0 

METDTLLLWVLLLWVPGSTGDGSMSDKI IHLTDDSFDTDVLKADGAILVDFWA 
EWCGPNSGWRWCGRCGALWWRRVTSGPCKMIAPILDEIADEYQGKLTVAKLNI 
DQNPGTAPKYGI RG I PTLLLFKNGE VAATKVGALS KGQLKE FLDANLA 

OST2 61 

METDTLLLWVLLLWVPGSTGDGSMSDKIIHLTDDSFDTDVLKADGAILVDFWA 
EWCGPNSEVRQVTCIRCRRGFLLTSGPCKMIAPILDEIADEYQGKLTVAKLNI 
DQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLA 

OST2 62 

METDTLLLWVLLLWVPGSTGDGSMSDKI IHLTDDS FDTDVLKADGAI LVDFWA 
EWCGPNSGGGGMI WEAWS CYACGTSGPCKMI AP I LDE I ADEYQGKLTVAKLMI 
DQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLA 

OST263 

METDTLLLWVLLLWVPGSTGDGSMSDKI IHLTDDSFDTDVLKADGAILVDFWA 
EWCGPNSLWIGPGDQGLFRRFVFTSGPCKMIAPILDEIADEYQGKLTVAKLNI 
DQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLA 

OST264 

METDTLLLWVLLLWVPGS TGDGSVS SDQNHFRGE IEETI TES FGNDHSTLDGY 
SRRTTLSSKMYHTKGQEGSVCLRSSDCASGLCCARHFWSKICKPVLKEGQVCT 
KHRRKGSHGLE I FQRC YCGEGLS CRI QKDHHQASNS SRLHTCQRH 



FIG. 44A 
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OST265 

METDTLLLWVLLLWVPGSTGDGSCASPTRGGDAGVQICIjACRKRRKRCMRHAM 
CCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMY 
HTKGQEGSVCLRS SDCASGLCCARHFWSKI CKPVLKEGQVCTKHRRKGSHGLE 
IFQRCYCGEGLSC. 

OST266 

MGDKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPNSYAWLFSCSRCRWWLPW 
TSGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLLLFKN 
GE VAATKVGALS KGQLKE FLDANLA 

OST267 

MGDKI IHLTDDS FDTDVLKADGAI LVDFWAE WCGPNS I CE WRLWSRYPWSWV 
TS G PCKM I AP I LDE I ADE YQGKLT VAKLNI DQNPGTAP KYG I RG I PTLLL FKN 
GE VAATKVGALS KGQLKE FLDANLA 

OST26 8 

MGDKI IHLTDDSFDTDVLKADGAILVDFWAEWCGPNSGCTSAVCGAWAEAGRF 
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SEQUENCE LISTING 

<110> Allen, Kristina M. 
Yaworsky, Paul 
Morales, Arturo J. 
Graham, James R. 
Anisowicz, Anthony 
Liu, Wei 

<120> HBM Variants that Modulate Bone Mass and Lipid Levels 

<130> 032796-135 

<150> US 60/290,071 
<151> 2001-05-11 

<150> US 60/291,311 
<151> 2001-05-17 

<150> US 60/353,058 
<151> 2002-02-01 

<150> US 60/361,293 
<151> 2002-03-04 

<160> 765 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 5120 

<212> DNA 

<213> Homo sapiens 

<400> 1 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 
Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 
15 10 
ctg ctg ctg ctg ctg ctg ctg gcg ctg tgc ggc tgc ccg gcc ccc gcc 157 
Leu Leu Leu Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala 

15 20 25 

gcg gcc teg ccg etc ctg eta ttt gcc aac cgc egg gac gta egg ctg 205 
Ala Ala Ser Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu 
30 35 40 45 

gtg gac gcc ggc gga gtc aag ctg gag tec ace ate gtg gtc age ggc 253 
Val Asp Ala Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly 

50 55 60 

ctg gag gat gcg gcc gca gtg gac ttc cag ttt tec aag gga gcc gtg 301 
Leu Glu Asp Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val 

65 70 75 

tac tgg aca gac gtg age gag gag gcc ate aag cag ace tac ctg aac 349 
Tyr Trp Thr Asp Val Ser Glu Glu Ala lie Lys Gin Thr Tyr Leu Asn 
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370 375 380 

ccg eta gag ggc tat gtc tac tgg aca gat gac gag gtg egg gec ate 12 61 

Pro Leu Glu Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala lie 

385 390 395 

cgc agg gcg tac ctg gac ggg tct ggg gcg cag acg ctg gtc aac acc 13 09 

Arg Arg Ala Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr 

400 405 410 

gag ate aac gac ccc gat ggc ate gcg gtc gac tgg gtg gee cga aac 1357 
Glu He Asn Asp Pro Asp Gly He Ala Val Asp Trp Val Ala Arg Asn 

415 420 425 

etc tac tgg acc gac acg ggc acg gac cgc ate gag gtg acg cgc etc 14 05 

Leu Tyr Trp Thr Asp Thr Gly Thr Asp Arg He Glu Val Thr Arg Leu 
430 435 440 445 

aac ggc acc tec cgc aag ate ctg gtg teg gag gac ctg gac gag ccc 1453 
Asn Gly Thr Ser Arg Lys He Leu Val Ser Glu Asp Leu Asp Glu Pro 

450 455 460 

cga gec ate gca ctg cac ccc gtg atg ggc etc atg tac tgg aca gac 15 01 

Arg Ala He Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp 

465 470 475 

tgg gga gag aac cct aaa ate gag tgt gee aac ttg gat ggg cag gag 1549 
Trp Gly Glu Asn Pro Lys He Glu Cys Ala Asn Leu Asp Gly Gin Glu 

480 485 490 

egg cgt gtg ctg gtc aat gee tec etc ggg tgg ccc aac ggc ctg gee 1597 
Arg Arg Val Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala 

495 500 505 

ctg gac ctg cag gag ggg aag etc tac tgg gga gac gec aag aca gac 
Leu Asp Leu Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp 
510 515 520 525 

aag ate gag gtg ate aat gtt gat ggg acg aag agg egg acc etc ctg 1693 
Lys He Glu Val He Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu 

530 535 540 

gag gac aag etc ccg cac att ttc ggg ttc acg ctg ctg ggg gac ttc 1741 
Glu Asp Lys Leu Pro His He Phe Gly Phe Thr Leu Leu Gly Asp Phe 

545 550 555 

ate tac tgg act gac tgg cag cgc cgc age ate gag egg gtg cac aag 1789 
He Tyr Trp Thr Asp Trp Gin Arg Arg Ser He Glu Arg Val His Lys 

560 565 570 

gtc aag gee age egg gac gtc ate att gac cag ctg ccc gac ctg atg 1837 
Val Lys Ala Ser Arg Asp Val He He Asp Gin Leu Pro Asp Leu Met 

575 580 585 

ggg etc aaa get gtg aat gtg gec aag gtc gtc gga acc aac ccg tgt 1885 
Gly Leu Lys Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys 
590 595 600 605 

gcg gac agg aac ggg ggg tgc age cac ctg tgc ttc ttc aca ccc cac 193 3 

Ala Asp Arg Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His 

610 615 620 

gca acc egg tgt ggc tgc ccc ate ggc ctg gag ctg ctg agt gac atg 1981 
Ala Thr Arg Cys Gly Cys Pro He Gly Leu Glu Leu Leu Ser Asp Met 

625 630 635 

aag acc tgc ate gtg cct gag gee ttc ttg gtc ttc acc age aga gec 2029 
Lys Thr Cys He Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala 

640 645 650 

gec ate cac agg ate tec etc gag acc aat aac aac gac gtg gec ate 2 077 

Ala He His Arg He Ser Leu Glu Thr Asn Asn Asn Asp Val Ala He 
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Asn 


Gly 


Pro 












1410 








1415 








1420 




ttc 


ccg 


cac 


gag 


tat 


gtc 


age 


ggg 


acc 


ccg 


cac 


gtg 


ccc 


etc 


aat 


ttc 


4381 


Phe 


Pro 


His 


Glu 


Tyr 


Val 


Ser 


Gly 


Thr 


Pro 


His 


Val 


Pro 


Leu 


Asn 


Phe 










1425 








1430 








1435 
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gee 


ccg 


ggc 


ggt 


tec 


cag 


cat 


ggc 


ccc 


ttc 


aca 


ggc 


ate 


gca 


tgc 


4429 


He 
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Pro 


Gly 


Gly 
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Gin 


His 


Gly 


Pro 


Phe 


Thr 
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He 


Ala 


Cys 








1440 








1445 








1450 








gga 


aag 


tec 


atg 


atg 


age 


tec 


gtg 


age 


ctg 


atg 


ggg 


ggc 


egg 


ggc 


ggg 


4477 


Gly 


Lys 


Ser 


Met 


Met 


Ser 


Ser 


Val 


Ser 


Leu 


Met 


Gly 


Gly 


Arg 


Gly 


Gly 






1455 








1460 








1465 










gtg 


ccc 


etc 


tac 


gac 


egg 


aac 


cac 


gtc 


aca 


ggg 


gec 


teg 


tec 


age 


age 


4525 
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Pro 


Leu 


Tyr 


Asp 


Arg 


Asn 


His 


val 


Thr 


Gly 


Ala 


Ser 


Ser 


Ser 


Ser 




1470 








1475 








1480 








1485 




teg 


tec 


age 


acg 


aag 


gee 


acg 


ctg 


tac 


ccg 


ccg 


ate 


ctg 


aac 


ccg 


ccg 


4573 


Ser 


Ser 


Ser 


Thr 


Lys 


Ala 


Thr 


Leu 


Tyr 


Pro 


Pro 


He 


Leu 


Asn 


Pro 


Pro 












1490 








1495 








1500 




ccc 


tec 


ccg 


gec 


acg 


gac 
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tec 


ctg 


tac 


aac 


atg 


gac 


atg 


ttc 


tac 


4621 
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Ser 


Pro 


Ala 


Thr 


Asp 


Pro 


Ser 


Leu 


Tyr 


Asn 


Met 


Asp 


Met 


Phe 


Tyr 










1505 








1510 
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tct 


tea 


aac 


att 


ccg 


gee 


act 


gcg 


aga 


ccg 


tac 


agg 


ccc 


tac 


ate 


att 


4669 


Ser 


Ser 


Asn 


He 


Pro 


Ala 


Thr 


Ala 


Arg 


Pro 


Tyr 


Arg 


Pro 


Tyr 


He 


He 
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1520 1525 1530 

cga gga atg gcg ccc ccg acg acg ccc tgc age acc gac gtg tgt gac 4717 
Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp 

1535 1540 1545 

age gac tac age gee age cgc tgg aag gee age aag tac tac ctg gat 4765 
Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp 
1550 1555 1560 1565 

ttg aac teg gac tea gac ccc tat cca ccc cca ccc acg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac ctg teg gcg gag gac age tgc ccg ccc teg ccc gee acc gag 4861 
Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 

1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gac 
Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 

1600 1605 1610 

tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4965 
Ser Ser 
1615 

aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5025 
taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5 085 

tggagaaata tttataaact taattttgta aaaca 5120 

<210> 2 

<211> 5120 

<212> DNA 

<213> Homo sapiens 

<400> 2 

actaaagege cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 
Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 
15 10 
ctg ctg ctg ctg ctg ctg ctg gcg ctg tgc ggc tgc ccg gee ccc gec 157 
Leu Leu Leu Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala 

15 20 25 

gcg gec teg ccg etc ctg eta ttt gee aac cgc egg gac gta egg ctg 
Ala Ala Ser Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu 
30 35 40 45 

gtg gac gec ggc gga gtc aag ctg gag tec acc ate gtg gtc age ggc 253 
Val Asp Ala Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly 

50 55 60 

ctg gag gat gcg gee gca gtg gac ttc cag ttt tec aag gga gec gtg 3 01 

Leu Glu Asp Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val 

65 70 75 

tac tgg aca gac gtg age gag gag gec ate aag cag acc tac ctg aac 
Tyr Trp Thr Asp Val Ser Glu Glu Ala lie Lys Gin Thr Tyr Leu Asn 

80 85 90 

cag acg ggg gee gee gtg cag aac gtg gtc ate tec ggc ctg gtc tct 3 97 

Gin Thr Gly Ala Ala Val Gin Asn Val Val lie Ser Gly Leu Val Ser 

95 100 105 

ccc gac ggc etc gee tgc gac tgg gtg ggc aag aag ctg tac tgg acg 445 
Pro Asp Gly Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr 
110 115 120 125 



205 



349 
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gac 


tea 


gag 


acc 


aac 


Asp 


Ser 


Glu 


Thr 
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egg 
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gtg 


etc 
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Arg 


Lys 


Val 


Leu 


Phe 
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ttg 


gac 


ccc 


get 


cac 


Leu 


Asp 


Pro 
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His 
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ccc 


egg 


att 


gag 


egg 


Pro 


Arg 


He 


Glu 


Arg 
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gtg 


gac 


teg 


gac 


att 


Val 


Asp 


Ser Asp 


He 


190 










gag 


cag 


aag 


etc 


tac 


Glu 


Gin 


Lys 


Leu 


Tyr 










210 


gec 


aac 


ctg 


gac 


ggc 


Ala 


Asn 


Leu Asp 


Gly 
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acg 


cac 


ccc 


ttc 


gee 


Thr 


His 


Pro 


Phe 


Ala 
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gac 


tgg 


cag 


acc 


cgc 


Asp 


Trp 


Gin 


Thr 


Arg 
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aag 


agg 


aag 


gag 


ate 


Lys 


Arg 


Lys 


Glu 


He 


270 










gtg 


ctg 


age 


cag 


gag 


Val 


Leu 


Ser 


Gin 


Glu 










290 


gac 


aat 


ggc 


ggc 


tgc 


Asp 


Asn 


Gly Gly 


Cys 
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ttc 


tac 


aca 


tgc 


gee 


Phe 


Tyr 


Thr 


Cys 


Ala 






320 






agg 


acg 


tgt 


aag 


gca 


Arg 


Thr 


Cys 


Lys 


Ala 




335 
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gac 


eta 
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agg 


Thr 


Asp 


Leu 


Arg 


Arg 
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gtg 


ctg 


cag 


gtg 


gac 


Val 


Leu 


Gin 


Val 


Asp 










370 


ccg 


eta 


gag 


ggc 


tat 


Pro 


Leu 


Glu Gly 


Tyr 








385 




cgc 


agg 


gcg 


tac 


ctg 


Arg 


Arg 


Ala 


Tyr 


Leu 



400 



cgc ate gag gtg gee 
Arg He Glu Val Ala 
135 

tgg cag gac ctt gac 
Trp Gin Asp Leu Asp 
150 

ggg tac atg tac tgg 
Gly Tyr Met Tyr Trp 
165 

gca ggg atg gat ggc 
Ala Gly Met Asp Gly 
180 

tac tgg ccc aat gga 
Tyr Trp Pro Asn Gly 
195 

tgg get gac gee aag 
Trp Ala Asp Ala Lys 
215 

teg ttc egg cag aag 
Ser Phe Arg Gin Lys 
230 

ctg acg etc tec ggg 
Leu Thr Leu Ser Gly 
245 

tec ate cat gee tgc 
Ser He His Ala Cys 
260 

ctg agt gee etc tac 
Leu Ser Ala Leu Tyr 
275 

egg cag cct ttc ttc 
Arg Gin Pro Phe Phe 
295 

tec cac ctg tgc ctg 
Ser His Leu Cys Leu 
310 

tgc ccc acg ggt gtg 
Cys Pro Thr Gly Val 
325 

gga gee gag gag gtg 
Gly Ala Glu Glu Val 
340 

ate teg ctg gac acg 
He Ser Leu Asp Thr 
355 

gac ate egg cac gee 
Asp He Arg His Ala 
375 

gtc tac tgg aca gat 
Val Tyr Trp Thr Asp 
390 

gac ggg tct ggg gcg 
Asp Gly Ser Gly Ala 
405 



aac 


etc 


aat 


ggc 


Asn 


Leu 


Asn 


Gly 


cag 


ccg 


agg 


gec 


Gin 


Pro 


Arg 


Ala 








155 


aca 


gac 


tgg 


gtt 


Thr 


Asp 


Trp 


Val 
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age 


acc 


egg 


aag 


Ser 


mr 


Arg 


Lys 
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ctg 


acc 


ate 


gac 


Leu 


Thr 


He 


Asp 


200 








etc 


age 


ttc 


ate 


Leu 


Ser 


Phe 


He 


gtg 


gtg 


gag 


ggc 


Val 


Val 


Glu 


Gly 








235 


gac 


act 


ctg 


tac 


Asp 


Thr 


Leu 


Tyr 






250 




aac 


aag 


cgc 


act 


Asn 
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tea 


ccc 


atg 


gac 


Ser 


Pro 


Met 


Asp 


280 








cac 


act 


cgc 


tgt 


His 


Thr 


Arg 


Cys 


ctg 


tec 


cca 


age 


Leu 


Ser 


Pro 


Ser 








315 


cag 


ctg 


cag 


gac 


Gin 


Leu 


Gin 


Asp 






330 




ctg 


ctg 


ctg 


gee 


Leu 


ucu 




Ala 




345 






ccg 


gac 


ttc 


acc 


Pro 


Asp 


Phe 


Thr 


360 








att 


gee 


ate 


gac 


He 


Ala 


He 


Asp 


gac 


gag 


gtg 


egg 


Asp 


Glu 


Val 


Arg 








395 


cag 


acg 


ctg 


gtc 


Gin 


Thr 


Leu 


Val 






410 





aca tec 493 

Thr Ser 

140 

ate gee 541 
He Ala 

gag acg 5 89 

Glu Thr 

ate att 637 
He He 

ctg gag 6 85 

Leu Glu 
205 

cac cgt 73 3 

His Arg 

220 

age ctg 7 81 

Ser Leu 

tgg aca 82 9 

Trp Thr 

ggg ggg 877 

Gly Gly 

ate cag 925 
He Gin 
285 

gag gag 973 

Glu Glu 

300 

gag cct 1021 
Glu Pro 

aac ggc 1069 
Asn Gly 

egg egg 1117 
Arg Arg 

gac ate 1165 
Asp He 
365 

tac gac 1213 

Tyr Asp 

380 

gec ate 1261 
Ala He 

aac acc 1309 
Asn Thr 
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ate 


aac 


gac 
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gat 


ggc 


ate 


gcg 


gtc 


Glu 
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Pro 
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Gly 


He 


Ala 


Val 




415 










420 








etc 


tac 


tgg 


acc 


gac 


acg 


ggc 


acg 


gac 


cgc 


Leu 


Tyr 


Trp 


Thr 


Asp 


Thr 


Gly 


Thr 


Asp 


Arg 


430 
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aac 


ggc 


acc 


tec 


cgc 


aag 


ate 


ctg 


gtg 


teg 


Asn 


Gly Thr 


Ser 


Arg 


Lys 


He 


Leu 


Val 


Ser 










450 










455 


cga 


gee 


ate 


gca 


ctg 


cac 


ccc 


gtg 


atg 


ggc 


Arg 


Ala 


lie 


Ala 


Leu 


His 


Pro 


Val 


Met 


Gly 








465 










470 




tgg 


gga 


gag 


aac 


cct 


aaa 


ate 


gag 


tgt 


gec 


Trp 


Gly Glu Asn 


Pro 


Lys 


He 


Glu 


Cys 


Ala 






480 










485 






egg 


cat 

ZD 


Qtq 

Z3 ZD 


ctg 


gtc 


aat 


gee 


tec 


etc 


ggg 


Arg 


Arq 


Val 


Leu 


Val 


Asn 


Ala 


Ser 


Leu 


Gly 




495 










500 








ctg 


aac 

ZJ 


ctg 


cag 


gag 


ggg 


aag 


etc 


tac 


tgg 


Leu 


Asp 


Leu 


Gin 


Glu Gly Lys 


Leu 


Tyr 


Trp 


510 










515 










aag 


ate 


gag 


QtQ 


ate 


aat 


gtt 


gat 


ggg 


acg 


Lys 


lie 


Glu 


Val 


lie 


Asn 


Val 


Asp 


Gly Thr 










530 










535 


gag 


gac 


aag 


etc 


ccg 


cac 


att 


ttc 


ggg 


ttc 


Glu 


Asp 


Lys 


Leu 


Pro 


His 


He 


Phe 


Gly 


Phe 








545 










550 




ate 


tac 


tgg 


act 


gac 


tgg 


cag 


cgc 


cgc 


age 


lie 


Tyr 


Trp 


Thr 


Asp 


Trp 


Gin Arg 


Arg 


Ser 






560 










565 






gtc 


aag 


gec 


age 


egg 


gac 


gtc 


ate 


att 


gac 


Val 


Lys 


Ala 


Ser 


Arg Asp 


Val 


He 


He 


Asp 
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get 


gtg 


aat 


gtg 


gee 


aag 


gtc 


Gly 


Leu 


Lys 


Ala 


Val 


Asn 


Val 


Ala 


Lys 


Val 
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agg 
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ggg 


ggg 
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age 
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ctg 
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Asn 


Gly Gly Cys 
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His 


Leu 
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615 
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egg 


tgt 


ggc 


tgc 
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ate 
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ctg 
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Thr Arg 
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Gly Cys 


Pro 


He 


Gly 


Leu 
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gtg 
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ttc 
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Thr 
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Phe 


Leu 
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ate 
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etc 


gag 


acc 


aat 
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lie 


His 


Arg 


He 


Ser 


Leu 


Glu 


Thr 


Asn 
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660 








ccg 
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acg 


ggc 


gtc 


aag 


gag 


gec 


tea 


gee 


Pro 


Leu 


Thr 
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Val 


Lys 


Glu 
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Ala 


670 
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age 
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Trp 


Thr Asp 


Val 
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gac 


tgg 


gtg 


gec 
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aac 
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Asp 


Trp 


Val 
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ate 


gag 
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1405 


He 


Glu 


Val 


Thr 
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Leu 
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gac 


ctg 


gac 


gag 


ccc 


1453 


Glu 


Asp 


Leu 


Asp 


Glu 


Pro 
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etc 


atg 


tac 


tgg 


aca 


gac 


1501 


Leu 


Met 


Tyr 


Trp 


Thr 


Asp 
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aac 


ttg 


gat 


ggg 


cag 


gag 


1549 


Asn 


Leu 


Asp 


Gly 


Gin 


Glu 
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tgg 


ccc 


aac 


ggc 


ctg 


gee 


1597 


Trp 


Pro 


Asn 


Gly 


Leu 


Ala 






505 












gga 


gac 


gec 


aag 


aca 


gac 


1645 


Gly 


Asp 


Ala 


Lys 


Thr 


Asp 




520 










525 




aag 


agg 


egg 


acc 


etc 


ctg 


1693 


Lys 


Arg 


Arg 


Thr 


Leu 


Leu 
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acg 


ctg 


ctg 


ggg 


gac 


ttc 


1741 


Thr 


Leu 


Leu 


Gly 


Asp 


Phe 










555 








ate 


gag 


egg 


gtg 


cac 


aag 


1789 


He 


Glu 


Arg 


Val 


His 


Lys 
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cag 


ctg 


ccc 


gac 


ctg 


atg 


1837 
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Pro 


Asp 
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Met 
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gtc 
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acc 


aac 
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tgt 
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Val 


Gly 


Thr 
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Pro 


Cys 




600 
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Phe 
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He 
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Arg 
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Arg 
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He 
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ttc 


gtg 


atg 


gac 


ate 


ctg 


2749 


Asn 


Arg 


Thr 


Leu 


He 


Gin Gly 


His 


Leu 


Asp 


Phe 


val 


Met 


Asp 


He 


Leu 








880 










885 










890 










gtg 


ttc 


cac 


tec 


tec 


cgc 


cag 


gat 


ggc 


etc 


aat 


gac 


tgt 


atg 


cac 


aac 


2797 


Val 


Phe 
895 


His 


Ser 


Ser 


Arg 


Gin 
900 


Asp 


Gly 


Leu 


Asn 


Asp 
905 


Cys 


Met 


His 


Asn 




aac 


ggg 


cag 


tgt 


ggg 


cag 


ctg 


tgc 


ctt 


gcc 


ate 


ccc 


ggc 


ggc 


cac 


cgc 


2845 


Asn 


Gly 


Gin 


Cys 


Gly Gin 


Leu 


Cys 


Leu 


Ala 


He 


Pro 


Gly 


Gly 


His 


Arg 




910 










915 










920 










925 




tgc 


ggc 


tgc 


gcc 


tea 


cac 


tac 


acc 


ctg 


gac 


ccc 


age 


age 


cgc 


aac 


tgc 


2893 


Cys 


Gly 


Cys 


Ala 


Ser 
930 


His 


Tyr 


Thr 


Leu 


Asp 
935 


Pro 


Ser 


Ser 


Arg 


Asn 
940 


Cys 




age 


ccg 


ccc 


acc 


acc 


ttc 


ttg 


ctg 


ttc 


age 


cag 


aaa 


tct 


gcc 


ate 


agt 


2941 


Ser 


Pro 


Pro 


Thr 
945 


Thr 


Phe 


Leu 


Leu 


Phe 
950 


Ser 


Gin 


Lys 


Ser 


Ala 
955 


He 


Ser 




egg 


atg 


ate 


ccg 


gac 


gac 


cag 


cac 


age 


ccg 


gat 


etc 


ate 


ctg 


ccc 


ctg 


2989 


Arg 


Met 


He 
960 


Pro 


Asp 


Asp 


Gin 


His 
965 


Ser 


Pro 


Asp 


Leu 


He 
970 


Leu 


Pro 


Leu 




cat 


gga 


ctg 


agg 


aac 


gtc 


aaa 


gcc 


ate 


gac 


tat 


gac 


cca 


ctg 


gac 


aag 


3037 


His 


Gly 
975 


Leu 


Arg 


Asn 


val 


Lys 
980 


Ala 


He 


Asp 


Tyr 


Asp 
985 


Pro 


Leu 


Asp 


Lys 
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ttc 


ate 


tac 


tgg 


gtg 


gat 


ggg 


cgc 


cag 


aac 


ate 


aag 


cga 


gee 


aag 


gac 


3085 


Phe 


lie 


Tyr 


Trp 


Val 


Asp 


Gly 


Arg 


Gin 


Asn 


He 


Lys 


Arg 


Ala 


Lys 


Asp 




990 










995 










1000 








1005 




gac 


ggg 


acc 


cag 


ccc 


ttt 


gtt 


ttg 


acc 


tct 


ctg 


age 


caa 


ggc 


caa 


aac 


3133 


Asp 


Gly 


Thr 


Gin 


Pro 


Phe 


Val 


Leu 


Thr 


Ser 


Leu 


Ser 


Gin 


Gly 


Gin 


Asn 








1010 








1015 








1020 




cca 


gac 


agg 


cag 


ccc 


cac 


gac 


etc 


age 


ate 


gac 


ate 


tac 


age 


egg 


aca 


3181 


Pro 


Asp 


Arg 


Gin 


Pro 


His 


Asp 


Leu 


Ser 


He 


Asp 


He 


Tyr 


Ser 


Arg 


Thr 








1025 








1030 








1035 






ctg 


ttc 


tgg 


acg 


tgc 


gag 


gec 


acc 


aat 


acc 


ate 


aac 


gtc 


cac 


agg 


ctg 


3229 


Leu 


Phe 


Trp 


Thr 


Cys 


Glu 


Ala 


Thr 


Asn 


Thr 


He 


Asn 


Val 


His 


Arg 


Leu 








1040 








1045 








1050 








age 


ggg 


gaa 


gee 


atg 


ggg 


gtg 


gtg 


ctg 


cgt 


ggg 


gac 


cgc 


gac 


aag 


ccc 


3277 


Ser 


Gly 


Glu 


Ala 


Met 


Gly 


Val 


Val 


Leu 


Arg 


Gly 


Asp 


Arg 


Asp 


Lys 


Pro 






1055 








1060 








1065 










agg 


gee 


ate 


gtc 


gtc 


aac 


gcg 


gag 


cga 


ggg 


tac 


ctg 


tac 


ttc 


acc 


aac 


3325 


Arg 


Ala 


He 


Val 


Val 


Asn 


Ala 


Glu 


Arg 


Gly 


Tyr 


Leu 


Tyr 


Phe 


Thr 


Asn 




1070 








1075 








1080 








1085 




atg 


cag 


gac 


egg 


gca 


gec 


aag 


ate 


gaa 


cgc 


gca 


gec 


ctg 


gac 


ggc 


acc 


3373 


Met 


Gin 


Asp 


Arg 


Ala 


Ala 


Lys 


He 


Glu 


Arg 


Ala 


Ala 


Leu 


Asp 


Gly 


Thr 












1090 








1095 








1100 




oaa 

y a y 


cgc 


gag 


gtc 


etc 


ttc 


acc 


acc 


ggc 


etc 


ate 


cgc 


cct 


gtg 


gee 


ctg 


3421 


Glu 


Arg 


Glu 


Val 


Leu 


Phe 


Thr 


Thr 


Gly 


Leu 


He 


Arg 


Pro 


Val 


Ala 


Leu 








1105 








1110 








1115 






gtg 


gtg 


gac 


aac 


aca 


ctg 


ggc 


aag 


ctg 


ttc 


tgg 


gtg 


gac 


gcg 


gac 


ctg 


3469 


Val 


Val 


Asp 


Asn 


Thr 


Leu 


Gly 


Lys 


Leu 


Phe 


Trp 


Val 


Asp 


Ala 


Asp 


Leu 








1120 








1125 








1130 








aag 


cgc 


att 


gag 


age 


tgt 


gac 


ctg 


tea 


ggg 


gec 


aac 


cgc 


ctg 


acc 


ctg 


3517 


Lys 


Arg 


He 


Glu 


Ser 


Cys 


Asp 


Leu 


Ser 


Gly 


Ala 


Asn 


Arg 


Leu 


Thr 


Leu 






1135 








1140 








1145 










gag 


gac 


gec 


aac 


ate 


gtg 


cag 


cct 


ctg 


ggc 


ctg 


acc 


ate 


ctt 


ggc 


aag 


3565 


Glu 


Asp 

) 


Ala 


Asn 


He 


Val 


Gin 


Pro 


Leu 


Gly 


Leu 


Thr 


He 


Leu 


Gly 


Lys 




115( 








1155 








1160 








1165 




cat 


etc 


tac 


tgg 


ate 


gac 


cgc 


cag 


cag 


cag 


atg 


ate 


gag 


cgt 


gtg 


gag 


3613 


His 


Leu 


Tyr 


Trp 


He 


Asp 


Arg 


Gin 


Gin 


Gin 


Met 


He 


Glu 


Arg 


Val 


Glu 












1170 








1175 








1180 




aag 


acc 


acc 


ggg 


gac 


aag 


egg 


act 


cgc 


ate 


cag 


ggc 


cgt 


gtc 


gee 


cac 


3661 


L Y S 


Thr 


Thr 


Gly 


Asp 


Lys 


Arg 


Thr 


Arg 


He 


Gin 


Gly 


Arg 


Val 


Ala 


His 








1185 








1190 








1195 






etc 


act 


ggc 


ate 


cat 


gca 


gtg 


gag 


gaa 


gtc 


age 


ctg 


gag 


gag 


ttc 


tea 


3709 


Leu 


Thr 


Gly 


He 


His 


Ala 


Val 


Glu 


Glu 


Val 


Ser 


Leu 


Glu 


Glu 


Phe 


Ser 








1200 








1205 








1210 








gec 


cac 


cca 


tgt 


gec 


cgt 


gac 


aat 


ggt 


ggc 


tgc 


tec 


cac 


ate 


tgt 


att 


3757 


Ala 


His 


Pro 


Cys 


Ala 


Arg 


Asp 


Asn Gly 


Gly 


Cys 


Ser 


His 


He 


Cys 


He 






1215 








1220 








1225 










gec 


aag 


ggt 


gat 


ggg 


aca 


cca 


egg 


tgc 


tea 


tgc 


cca 


gtc 


cac 


etc 


gtg 


3805 


Ala 


Lys 


Gly 


Asp 


Gly 


Thr 


Pro 


Arg 


Cys 


Ser 


Cys 


Pro 


Val 


His 


Leu 


Val 




1230 








1235 








1240 








1245 




etc 


ctg 


cag 


aac 


ctg 


ctg 


acc 


tgt 


gga 


gag 


ccg 


ccc 


acc 


tgc 


tec 


ccg 


3853 


Leu 


Leu 


Gin 


Asn 


Leu 


Leu 


Thr 


Cys 


Gly 


Glu 


Pro 


Pro 


Thr 


Cys 


Ser 


Pro 












1250 








1255 








1260 




gac 


cag 


ttt 


gca 


tgt 


gec 


aca 


ggg 


gag 


ate 


gac 


tgt 


ate 


ccc 


ggg 


gee 


3901 


Asp 


Gin 


Phe 


Ala 


Cys 


Ala Thr Gly Glu 


He 


Asp 


Cys 


He 


Pro 


Gly Ala 





1265 1270 1275 



-11- 



BNSDOCID: <WO 02092000A2_I_> 




WO 02/092000 PCT/US02/14877 



tgg 


cgc 


tgt 


gac 


ggc 


ttt 


ccc 


gag 


tgc 


gat 


gac 


cag 


age 


gac 


gag 


gag 


3949 


Trp 


Arg 


Cys 


Asp 


Gly Phe 


Pro 


Glu 


Cys 


Asp 


Asp 


Gin 


Ser 


Asp 


Glu 


Glu 








1280 








1285 








1290 








ggc 


tgc 


ccc 


gtg 


tgc 


tec 


gee 


gee 


cag 


ttc 


ccc 


tgc 


gcg 


egg 


ggt 


cag 


3997 


Gly Cys 


Pro 


Val 


Cys 


Ser 


Ala 


Ala 


Gin 


Phe 


Pro 


Cys 


Ala 


Arg 


Gly 


Gin 






1295 








1300 








1305 










tgt 


gtg 


gac 


ctg 


cgc 


ctg 


cgc 


tgc 


gac 


ggc 


gag 


gca 


gac 


tgt 


cag 


gac 


4045 


Cys 


Val 


Asp 


Leu 


Arg 


Leu 


Arg 


Cys 


Asp 


Gly Glu Ala 


Asp 


Cys 


Gin 


Asp 




1310 








1315 








1320 








1325 




cgc 


tea 


gac 


gag 


gtg gac 


tgt 


gac 


gec 


ate 


tgc 


ctg 


ccc 


aac 


cag 


ttc 


4093 


Arg 


Ser 


Asp 


Glu 


Val 


Asp 


Cys 


Asp 


Ala 


He 


Cys 


Leu 


Pro 


Asn 


Gin 


Phe 












1330 








1335 








1340 




egg 


tgt 


gcg 


age 


ggc 


cag 


tgt 


gtc 


etc 


ate 


aaa 


cag 


cag 


tgc 


gac 


tec 


4141 


Arg 


Cys 


Ala 


Ser 


Gly 


Gin 


Cys 


Val 


Leu 


He 


Lys 


Gin 


Gin 


Cys 


Asp 


Ser 










1345 








1350 








1355 






ttc 


ccc 


gac 


tgt 


ate 


gac 


ggc 


tec 


gac 


gag 


etc 


atg 


tgt 


gaa 


ate 


acc 


4189 


Phe 


Pro 


Asp 


Cys 


He 


Asp 


Gly 


Ser 


Asp 


Glu 


Leu 


Met 


Cys 


Glu 


He 


Thr 








1360 








1365 








1370 








aag 


ccg 


ccc 


tea 


gac 


gac 


age 


ccg 


gee 


cac 


age 


agt 


gee 


ate 


ggg 


ccc 


4237 


Lys 


Pro 


Pro 


Ser 


Asp 


Asp 


Ser 


Pro 


Ala 


His 


Ser 


Ser 


Ala 


He 


Gly 


Pro 






1375 








1380 








1385 










gtc 


att 


ggc 


ate 


ate 


etc 


tct 


etc 


ttc 


gtc 


atg 


ggt 


ggt 


gtc 


tat 


ttt 


4285 


Val 


He 


Gly 


He 


He 


Leu 


Ser 


Leu 


Phe 


Val 


Met 


Gly 


Gly 


Val 


Tyr 


Phe 




1390 








1395 








1400 








1405 




gtg 


tgc 


cag 


cgc 


gtg 


gtg 


tgc 


cag 


cgc 


tat 


gcg 


ggg 


gee 


aac 


ggg 


ccc 


4333 


Val 


Cys 


Gin 


Arg 


Val 


Val 


Cys 


Gin 


Arg 


Tyr 


Ala 


Gly 


Ala 


Asn 


Gly 


Pro 












1410 








1415 








1420 




ttc 


ccg 


cac 


gag 


tat 


gtc 


age 


ggg 


acc 


ccg 


cac 


gtg 


ccc 


etc 


aat 


ttc 


4381 


Phe 


Pro 


His 


Glu 


Tyr 


Val 


Ser 


Gly 


Thr 


Pro 


His 


Val 


Pro 


Leu 


Asn 


Phe 










1425 








1430 








1435 






ata 


gee 


ccg 


ggc 


ggt 


tec 


cag 


cat 


ggc 


ccc 


ttc 


aca 


ggc 


ate 


gca 


tgc 


4429 


He 


Ala 


Pro 


Gly 


Gly 


Ser 


Gin 


His 


Gly 


Pro 


Phe 


Thr 


Gly 


He 


Ala 


Cys 








1440 








1445 








1450 








gga 


aag 


tec 


atg 


atg 


age 


tec 


gtg 


age 


ctg 


atg 


ggg 


ggc 


egg 


ggc 


ggg 


4477 


Gly 


Lys 


Ser 


Met 


Met 


Ser 


Ser 


Val 


Ser 


Leu 


Met 


Gly 


Gly 


Arg 


Gly 


Gly 






1455 








1460 








1465 










gtg 


ccc 


etc 


tac 


gac 


egg 


aac 


cac 


gtc 


aca 


ggg 


gee 


teg 


tec 


age 


age 


4525 


val 


Pro 


Leu 


Tyr 


Asp 


Arg 


Asn 


His 


Val 


Thr 


Gly 


Ala 


Ser 


Ser 


Ser 


Ser 




1470 








1475 








1480 








1485 




teg 


tec 


age 


acg 


aag 


gee 


acg 


ctg 


tac 


ccg 


ccg 


ate 


ctg 


aac 


ccg 


ccg 


4573 


Ser 


Ser 


Ser 


Thr 


Lys 


Ala 


Thr 


Leu 


Tyr 


Pro 


Pro 


He 


Leu 


Asn 


Pro 


Pro 












1490 








1495 








1500 




ccc 


tec 


ccg 


gee 


acg 


gac 


ccc 


tec 


ctg 


tac 


aac 


atg 


gac 


atg 


ttc 


tac 


4621 


Pro 


Ser 


Pro 


Ala 


Thr Asp 


Pro 


Ser 


Leu 


Tyr 


Asn 


Met 


Asp 


Met 


Phe 


Tyr 










1505 








1510 








1515 






tct 


tea 


aac 


att 


ccg 


gee 


act 


gcg 


aga 


ccg 


tac 


agg 


ccc 


tac 


ate 


att 


4669 


Ser 


Ser 


Asn 


He 


Pro 


Ala 


Thr 


Ala 


Arg 


Pro 


Tyr 


Arg 


Pro 


Tyr 


He 


He 








1520 








1525 








1530 








cga 


gga 


atg 


gcg 


ccc 


ccg 


acg 


acg 


ccc 


tgc 


age 


acc 


gac 


gtg 


tgt 


gac 


4717 


Arg 


Gly 


Met 


Ala 


Pro 


Pro 


Thr 


Thr 


Pro 


Cys 


Ser 


Thr 


Asp 


Val 


Cys 


Asp 






1535 








1540 








1545 










age 


gac 


tac 


age 


gee 


age 


cgc 


tgg 


aag 


gee 


age 


aag 


tac 


tac 


ctg 


gat 


4765 


Ser 


Asp 


Tyr 


Ser 


Ala 


Ser 


Arg 


Trp 


Lys 


Ala 


Ser 


Lys 


Tyr 


Tyr 


Leu 


Asp 





1550 1555 1560 1565 
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ttg aac teg gac tea gac ccc tat cca ccc cca ccc acg ccc cac age 4 813 

Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac ctg teg gcg gag gac age tgc ccg ccc teg ccc gee acc gag 4 861 

Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 

1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gac 4909 
Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 

1600 1605 1^10 

tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4965 
Ser Ser 
1615 

aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5 02 5 

taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5085 
tggagaaata tttataaact taattttgta aaaca 

<210> 3 
<211> 1615 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala Ala Ala Ser 

20 25 30 

Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala 

35 40 45 

Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly Leu Glu Asp 

50 55 60 

Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val Tyr Trp Thr 
65 70 75 80 

Asp Val Ser Glu Glu Ala lie Lys Gin Thr Tyr Leu Asn Gin Thr Gly 

85 90 95 

Ala Ala Val Gin Asn Val Val lie Ser Gly Leu Val Ser Pro Asp Gly 

100 105 110 

Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr Asp Ser Glu 

115 120 125 

Thr Asn Arg lie Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val 

130 135 140 

Leu Phe Trp Gin Asp Leu Asp Gin Pro Lys Ala lie Ala Leu Asp Pro 
145 150 155 160 

Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Gly Glu Thr Pro Arg lie 

165 170 175 

Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys He lie Val Asp Ser 

180 185 190 

Asp He Tyr Trp Pro Asn Gly Leu Thr He Asp Leu Glu Glu Gin Lys 

195 200 205 

Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe He His Arg Ala Asn Leu 

210 215 220 

Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu Thr His Pro 
225 230 235 240 

Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Trp Gin 
245 250 255 
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Thr 


Arg 


Ser 


He 
260 


His 


Ala 


Cys 


Asn 


Lys 
265 


Arg 


Thr 


Gly 


Gly 


Lys 
270 


Arg 


Lys 


Glu 


He 


Leu 
275 


Ser 


Ala 


Leu 


Tyr 


Ser 
280 


Pro 


Met 


Asp 


He 


Gin 
285 


Val 


Leu 


Ser 


Gin 


Glu 
290 


Arg 


Gin 


Pro 


Phe 


Phe 
295 


His 


Thr 


Arg 


Cys 


Glu 
300 


Glu 


Asp 


Asn 


Gly 


Gly 


Trp 


Ser 


His 


Leu 


Cys 


Leu 


Leu 


Ser 


Pro 


Ser 


Glu 


Pro 


Phe 


Tyr 


Thr 


305 










310 










315 










320 


Cys 


Ala 


Cys 


Pro 


Thr 
325 


Gly 


Val 


Gin 


Met 


Gin 
330 


Asp 


Asn 


Gly 


Arg 


Thr 
335 


Cys 


Lys 


Ala 


Gly Ala 


Glu 


Glu 


Val 


Leu 


Leu 


Leu 


Ala 


Arg 


Arg 


Thr 


Asp 


Leu 








340 










345 










350 






Arg 


Arg 


He 


Ser 


Leu Asp 


Thr 


Pro 


Asp 


Phe 


Thr 


Asp 


He 


Val 


Leu 


Gin 






355 










360 










365 








Val 


Asp 


Asp 


He Arg His 


Ala 


He 


Ala 


He 


Asp 


Tyr 


Asp 


Pro 


Leu 


Glu 




370 










375 










380 










Gly 


Tyr 


Val 


Tyr 


Trp 


Thr 


Asp 


Asp 


Glu 


Val 


Arg 


Ala 


He 


Arg 


Arg 


Ala 


385 










390 










395 










400 


Tyr 


Leu 


Asp 


Gly 


Ser 


Gly Ala 


Gin 


Thr 


Leu 


Val 


Asn 


Thr 


Glu 


He 


Asn 










405 










410 










415 




Asp 


Pro 


Asp 


Gly 
420 


He 


Ala 


Val 


Asp 


Trp 
425 


Val 


Ala 


Arg 


Asn 


Leu 
430 


Tyr 


Trp 


Thr 


Asp 


Thr Gly Thr Asp 


Arg 


He 


Glu 


Val 


Thr 


Arg 


Leu 


Asn 


Gly 


Thr 






435 










440 










445 








Ser 


Arg 
450 


Lys 


He 


Leu 


Val 


Ser 
455 


Glu 


Asp 


Leu 


Asp 


Glu 
460 


Pro 


Arg 


Ala 


He 


Ala 


Leu 


His 


Pro 


Val 


Met 


Gly 


Leu 


Met 


Tyr 


Trp 


Thr 


Asp 


Trp 


Gly 


Glu 


465 










470 










475 










480 


Asn 


Pro 


Lys 


He 


Glu 
485 


Cys 


Ala 


Asn 


Leu 


Asp 
490 


Gly 


Gin 


Glu 


Arg 


Arg 
495 


Val 


Leu 


Val 


Asn 


Ala 


Ser 


Leu 


Gly Trp 


Pro 


Asn 


Gly 


Leu 


Ala 


Leu 


Asp 


Leu 








500 










505 










510 






Gin 


Glu 


Gly 
515 


Lys 


Leu 


Tyr 


Trp 


Gly 
520 


Asp 


Ala 


Lys 


Thr 


Asp 
525 


Lys 


He 


Glu 


Val 


He 


Asn Val Asp Gly Thr Lys 


Arg Arg 


Thr 


Leu 


Leu 


Glu 


Asp 


Lys 




530 










535 










540 










Leu 


Pro 


His 


He 


Phe 


Gly 


Phe 


Thr 


Leu 


Leu 


Gly 


Asp 


Phe 


He 


Tyr 


Trp 


545 










550 










555 










560 


Thr 


Asp 


Trp 


Gin 


Arg 
565 


Arg 


Ser 


He 


Glu 


Arg 
570 


Val 


His 


Lys 


Val 


Lys 
575 


Ala 


Ser 


Arg 


Asp 


Val 
580 


He 


He 


Asp 


Gin 


Leu 
585 


Pro 


Asp 


Leu 


Met 


Gly 
590 


Leu 


Lys 


Ala 


Val 


Asn 


Val 


Ala 


Lys 


Val 


Val 


Gly Thr 


Asn 


Pro 


Cys 


Ala 


Asp 


Arg 






595 










600 










605 








Asn 


Gly 


Gly Cys 


Ser 


His 


Leu 


Cys 


Phe 


Phe 


Thr 


Pro 


His 


Ala 


Thr 


Arg 




610 










615 










620 










Cys 


Gly 


Cys 


Pro 


He 


Gly Leu Glu 


Leu 


Leu 


Ser 


Asp 


Met 


Lys 


Thr 


Cys 


625 










630 










635 










640 


lie 


Val 


Pro 


Glu 


Ala 
645 


Phe 


Leu 


Val 


Phe 


Thr 
650 


Ser 


Arg 


Ala 


Ala 


He 
655 


His 


Arg 


He 


Ser 


Leu 


Glu 


Thr 


Asn 


Asn 


Asn Asp 


Val 


Ala 


He 


Pro 


Leu 


Thr 








660 










665 










670 






Gly 


Val 


Lys 
675 


Glu 


Ala 


Ser 


Ala 


Leu 
680 


Asp 


Phe 


Asp 


Val 


Ser 
685 


Asn 


Asn 


His 
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He 


Tyr 


Trp 


Thr 


Asp 


Val 


Ser 


Leu 


Lys 


Asn 


He 


Ser 


Arg 


Ala 


Phe 


Met 




690 










695 










700 










Asn 


Gly 


Ser 


Ser 


Val 


Glu 


His 


Val 


Val 


Glu 


Phe 


Gly 


Leu 


Asp 


Tyr 


Pro 


705 








710 










715 










720 


Glu 


Gly 


Met 


Ala 


Val 


Asp 


Trp 


Met 


Gly 


Lys 


Asn 


Leu 


Tyr 


Trp 


Ala 


Asp 










725 










730 










735 




Thr 


Gly 


Thr 


Asn 


Arg 


He 


Glu 


Val 


Ala 


Arg 


Leu 


Asp 


Gly 


Gin 


Phe 


Arg 








740 










745 










750 






Gin 


Val 


Leu 


Val 


Trp 


Arg 


Asp 


Leu 


Asp 


Asn 


Pro 


Arg 


Ser 


Leu 


Ala 


Leu 






755 










760 










765 








Asp 


Pro 


Thr 


Lys 


Gly 


Tyr 


He 


Tyr 


Trp 


Thr 


Glu 


Trp 


Gly 


Gly Lys 


Pro 




770 










775 










780 










Arg 


He 


Val 


Arg 


Ala 


Phe 


Met 


Asp 


Gly 


Thr 


Asn 


Cys 


Met 


Thr 


Leu 


Val 


785 










790 










795 










800 


Asp 


Lys 


Val 


Gly 


Arg 


Ala 


Asn 


Asp 


Leu 


Thr 


He 


Asp 


Tyr 


Ala 


Asp 


Gin 










805 










810 










815 




Arg 


Leu 


Tyr 


Trp 


Thr 


Asp 


Leu 


Asp 


Thr 


Asn 


Met 


He 


Glu 


Ser 


Ser 


Asn 








820 










825 










830 






Met 


Leu 


Gly 


Gin 


Glu 


Arg 


Val 


Val 


He 


Ala 


Asp 


Asp 


Leu 


Pro 


His 


Pro 






835 










840 










845 








Phe 


Gly 


Leu 


Thr 


Gin 


Tyr 


Ser 


Asp 


Tyr 


He 


Tyr 


Trp 


Thr 


As P 


Trp 


Asn 




850 










855 










860 










Leu 


His 


Ser 


He 


Glu 


Arg 


Ala 


Asp 


Lys 


Thr 


Ser 


Gly 


Arg 


Asn 


Arg 


Thr 


865 










870 










875 










880 


Leu 


lie 


Gin 


Gly 


His 


Leu 


Asp 


Phe 


Val 


Met 


Asp 


He 


Leu 


Val 


Phe 


His 










885 










890 










895 




Ser 


Ser 


Arg 


Gin 


Asp 


Gly 


Leu 


Asn 


Asp 


Cys 


Met 


His 


Asn 


Asn 


Gly 


Gin 








900 










905 










910 






Cys 


Gly 


Gin 


Leu 


Cys 


Leu 


Ala 


He 


Pro 


Gly 


Gly 


His 


Arg 


Cys 


Gly 


Cys 






915 










920 










925 








Ala 


Ser 


His 


Tyr 


Thr 


Leu 


Asp 


Pro 


Ser 


Ser 


Arg 


Asn 


Cys 


Ser 


Pro 


Pro 




930 










935 










940 










Thr 


Thr 


Phe 


Leu 


Leu 


Phe 


Ser 


Gin 


Lys 


Ser 


Ala 


He 


Ser 


Arg 


Met 


He 


945 










950 










955 










960 


Pro 


Asp 


Asp 


Gin 


His 


Ser 


Pro 


Asp 


Leu 


He 


Leu 


Pro 


Leu 


His 


Gly 


Leu 










965 










970 










975 




Arg 


Asn 


Val 


Lys 


Ala 


He 


Asp 


Tyr 


Asp 


Pro 


Leu 


Asp 


Lys 


Phe 


He 


Tyr 








980 










985 










990 






Trp 


Val 


Asp 


Gly 


Arg 


Gin 


Asn 


He 


Lys 


Arg 


Ala 


Lys 


Asp 


Asp 


Gly 


Thr 






995 










1000 








1005 






Gin 


Pro 


Phe 


Val 


Leu 


Thr 


Ser 


Leu 


Ser 


Gin 


Gly 


Gin 


Asn 


Pro 


Asp 


Arg 




1010 








1015 








1020 








Gin 


Pro 


His 


Asp 


Leu 


Ser 


He Asp 


He 


Tyr 


Ser 


Arg 


Thr 


Leu 


Phe 


Trp 



1025 1030 1035 1040 

Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu Ser Gly Glu 

1045 1050 1055 

Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala He 

1060 1065 1070 

Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 

1075 1080 1085 

Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

1090 1095 1100 

Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 
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Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg lie 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155 1160 1165 

Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr 

1170 1175 1180 

Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu Thr Gly 
1185 1190 1195 1200 

He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 

1205 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val He Gly 

1380 1385 1390 

He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 
1540 1545 ^ 1550 
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Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 " 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 " 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 4 

<211> 1615 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu Leu Leu Leu 

15 10 15 

Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala Ala Ala Ser 

20 25 30 

Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala 

35 40 45 

Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly Leu Glu Asp 

50 55 60 

Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val Tyr Trp Thr 
65 70 75 80 

Asp Val Ser Glu Glu Ala He Lys Gin Thr Tyr Leu Asn Gin Thr Gly 

85 90 95 

Ala Ala Val Gin Asn Val Val He Ser Gly Leu Val Ser Pro Asp Gly 

100 105 HO 

Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr Asp Ser Glu 

115 120 125 

Thr Asn Arg He Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val 

13 0 135 14 0 

Leu Phe Trp Gin Asp Leu Asp Gin Pro Lys Ala He Ala Leu Asp Pro 
145 ~ 150 155 160 

Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Val Glu Thr Pro Arg He 

165 170 175 

Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys He He Val Asp Ser 

180 185 190 

Asp He Tyr Trp Pro Asn Gly Leu Thr He Asp Leu Glu Glu Gin Lys 

195 200 205 

Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe He His Arg Ala Asn Leu 

210 215 220 

Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu Thr His Pro 
225 230 235 240 

Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Trp Gin 

245 250 255 

Thr Arg Ser He His Ala Cys Asn Lys Arg Thr Gly Gly Lys Arg Lys 

260 265 270 

Glu He Leu Ser Ala Leu Tyr Ser Pro Met Asp He Gin Val Leu Ser 

275 280 285 

Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu Asp Asn Gly 
290 295 300 
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Gly 


Trp 


Ser 


His 


305 








Cys 


Ala 


Cys 


Pro 


Lys 


Ala 


Gly 


Ala 








340 


Aror 


Arg 


He 


Ser 






355 




Val 


Asp 


Asp 


He 




370 






Gly 


Tyr 


Val 


Tyr 


385 








Tyr 


Leu 


Asp 


Gly 


Asp 


Pro 


Asp 


Gly 








420 


Thr 


Asp 


Thr 


Gly 






435 




Ser 


Arg 


Lys 


He 




450 






Ala 


Leu 


His 


Pro 


465 








Asn 


Pro 


Lys 


He 


Leu 


Val 


Asn 


Ala 








500 


Gin 


Glu 


Gly 


Lys 






515 




Val 


He 


Asn 


Val 




530 






Leu 


Pro 


His 


He 


545 








Thr 


Asp 


Trp 


Gin 


Ser 


Arg 


Asp 


Val 








580 




Via 1 




V Ct X 






595 




Asn 


Gly 


Gly 


Cys 




610 






Cys 


Gly 


Cys 


Pro 


625 








lie 


Val 


Pro 


Glu 


Arg 


He 


Ser 


Leu 








660 


Gly 


Val 


Lys 


Glu 






675 




lie 


Tyr 


Trp 


Thr 




690 






Asn 


Gly 


Ser 


Ser 


705 








Glu 


Gly 


Met 


Ala 



Leu Cys Leu Leu 
310 

Thr Gly Val Gin 
325 

Glu Glu Val Leu 

Leu Asp Thr Pro 
360 

Arg His Ala He 
375 

Trp Thr Asp Asp 
390 

Ser Gly Ala Gin 
405 

He Ala Val Asp 

Thr Asp Arg He 
440 

Leu Val Ser Glu 
455 

Val Met Gly Leu 
470 

Glu Cys Ala Asn 
485 

Ser Leu Gly Trp 

Leu Tyr Trp Gly 
520 

Asp Gly Thr Lys 
535 

Phe Gly Phe Thr 
550 

Arg Arg Ser He 
565 

He He Asp Gin 

Ala Lys Val Val 
600 

Ser His Leu Cys 
615 

He Gly Leu Glu 
630 

Ala Phe Leu Val 
645 

Glu Thr Asn Asn 

Ala Ser Ala Leu 
680 

Asp Val Ser Leu 
695 

Val Glu His Val 
710 

Val Asp Trp Met 
725 



Ser Pro Ser Glu 
315 

Met Gin Asp Asn 
330 

Leu Leu Ala Arg 
345 

Asp Phe Thr Asp 

Ala He Asp Tyr 
380 

Glu Val Arg Ala 
395 

Thr Leu Val Asn 
410 

Trp Val Ala Arg 
425 

Glu Val Thr Arg 

Asp Leu Asp Glu 
460 

Met Tyr Trp Thr 
475 

Leu Asp Gly Gin 
490 

Pro Asn Gly Leu 
505 

Asp Ala Lys Thr 

Arg Arg Thr Leu 
540 

Leu Leu Gly Asp 
555 

Glu Arg Val His 
570 

Leu Pro Asp Leu 
585 

Gly Thr Asn Pro 

Phe Phe Thr Pro 
620 

Leu Leu Ser Asp 
635 

Phe Thr Ser Arg 
650 

Asn Asp Val Ala 
665 

Asp Phe Asp Val 

Lys Asn He Ser 
700 

Val Glu Phe Gly 
715 

Gly Lys Asn Leu 
730 



Pro 


Phe 


Tyr 


Thr 








320 


Glv 




Thr 


Cys 






335 




Arg 


Thr 


Asp 


Leu 




350 






He 


Val 


Leu 


Gin 


365 








Asp 


Pro 


Leu 


Glu 


He 


Arg 


Arg 


Ala 








400 


Thr 


Glu 


He 


Asn 






415 




Asn 


Leu 


Tyr 


Trp 




430 






Leu 


Asn 


Gly 


Thr 


445 








Pro 


Arg 


Ala 


He 


Asp 


Trp 


Gly 


Glu 








480 


Glu 




Arg 


Val 






495 




Ala 


Leu 


Asp 


Leu 




510 






Asp 


Lys 


He 


Glu 


525 








Leu 


Glu 


Asp 


Lys 


Phe 


He 


Tyr 


Trp 








560 


Lys 


Val 


Lys 


Ala 






575 




Met 


Gly 


Leu 


Lys 




590 






Cys 


Ala 


Asp 


Arg 


605 








His 


Ala 


Thr 


Arg 


Met 


Lys 


Thr 


Cys 








640 


Ala 


Ala 


He 


His 






655 




He 


Pro 


Leu 


Thr 




670 






Ser 


Asn 


Asn 


His 


685 








Arg 


Ala 


Phe 


Met 


Leu 


Asp 


Tyr 


Pro 








720 


Tyr 


Trp 


Ala 


Asp 






735 
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Thr 


Gly 


Thr 


Asn 


Arg 


He 


Glu 


Val 


Ala 


Arg 


Leu 


Asp Gly 


Gin 


Phe 


Arg 






740 










745 










750 






Gin 


Val 


Leu 
755 


Val 


Trp 


Arg 


Asp 


Leu 
760 


Asp 


Asn 


Pro 


Arg 


Ser 
765 


Leu 


Ala 


Leu 


Asp 


Pro 
770 


Thr 


Lys 


Gly 


Tyr 


He 
775 


Tyr 


Trp 


Thr 


Glu 


Trp 
780 


Gly 


Gly 


Lys 


Pro 


Arg 


He 


Val 


Arg 


Ala 


Phe 


Met 


Asp 


Gly 


Thr 


Asn 


Cys 


Met 


Thr 


Leu 


Val 


785 










790 










795 










800 


Asp 


Lys 


Val 


Gly 


Arg 
805 


Ala 


Asn 


Asp 


Leu 


Thr 
810 


He 


Asp 


Tyr 


Ala 


Asp 
815 


Gin 


Arg 


Leu 


Tyr 


Trp 
820 


Thr 


Asp 


Leu 


Asp 


Thr 
825 


Asn 


Met 


He 


Glu 


Ser 
830 


Ser 


Asn 


Met 


Leu 


Gly 
835 


Gin 


Glu 


Arg 


Val 


Val 
840 


He 


Ala 


Asp 


Asp 


Leu 
845 


Pro 


His 


Pro 


Phe 


Gly 
850 


Leu 


Thr 


Gin 


Tyr 


Ser 
855 


Asp 


Tyr 


He 


Tyr 


Trp 
860 


Thr 


Asp 


Trp 


Asn 


Leu 


His 


Ser 


He 


Glu 


Arg 


Ala 


Asp 


Lys 


Thr 


Ser 


Gly Arg 


Asn 


Arg 


Thr 


865 










870 










875 










880 


Leu 


He 


Gin 


Gly 


His 


Leu 


Asp 


Phe 


Val 


Met 


Asp 


He 


Leu 


Val 


Phe 


His 








885 










890 










895 




Ser 


Ser 


Arg 


Gin 


Asp 


Gly 


Leu 


Asn 


Asp 


Cys 


Met 


His 


Asn 


Asn 


Gly Gin 








900 










905 










910 






Cys 


Gly 


Gin 


Leu 


Cys 


Leu 


Ala 


He 


Pro 


Gly 


Gly 


His 


Arg 


Cys 


Gly 


Cys 




915 










920 










925 








Ala 


Ser 
930 


His 


Tyr 


Thr 


Leu 


Asp 
935 


Pro 


Ser 


Ser 


Arg 


Asn 
940 


Cys 


Ser 


Pro 


Pro 


Thr 


Thr 


Phe 


Leu 


Leu 


Phe 


Ser 


Gin 


Lys 


Ser 


Ala 


He 


Ser 


Arg 


Met 


He 


945 










950 










955 










960 


Pro 


Asp 


Asp 


Gin 


His 


Ser 


Pro 


Asp 


Leu 


He 


Leu 


Pro 


Leu 


His 


Gly Leu 








965 










970 










975 




Arg 


Asn 


Val 


Lys 


Ala 


He 


Asp 


Tyr 


Asp 


Pro 


Leu 


Asp 


Lys 


Phe 


He 


Tyr 






980 










985 










990 






Trp 


Val 


Asp 


Gly 


Arg 


Gin 


Asn 


He 


Lys 


Arg 


Ala 


Lys 


Asp 


Asp 


Gly Thr 






995 










1000 








1005 






Gin 


Pro 


Phe 


Val 


Leu 


Thr 


Ser 


Leu 


Ser 


Gin 


Gly 


Gin 


Asn 


Pro 


Asp 


Arg 




1010 








1015 








1020 








Gin 


Pro 


His 


Asp 


Leu 


Ser 


He 


Asp 


He 


Tyr 


Ser 


Arg 


Thr 


Leu 


Phe 


Trp 


1025 






1030 








1035 








104' 


Thr 


Cys 


Glu 


Ala 


Thr 


Asn 


Thr 


He 


Asn 


Val 


His 


Arg 


Leu 


Ser 


Gly 


Glu 








1045 








1050 








1055 


Ala 


Met 


Gly 


Val 


Val 


Leu 


Arg 


Gly Asp 


Arg Asp 


Lys 


Pro 


Arg 


Ala 


He 








1060 








1065 








1070 




Val 


Val 


Asn 


Ala Glu Arg 


Gly 


Tyr 


Leu 


Tyr 


Phe 


Thr 


Asn 


Met 


Gin 


Asp 



1075 1080 1085 

Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 
1090 1095 1100 



Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 
1155 H60 1165 
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Trp lie Asp Arg Gin Gin Gin Met lie Glu Arg Val Glu Lys Thr Thr 

1170 1175 1180 

Gly Asp Lys Arg Thr Arg lie Gin Gly Arg Val Ala His Leu Thr Gly 
1185 1190 1195 1200 

lie His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 

1205 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys lie Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val He Gly 

1380 1385 1390 

He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 1550 

Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 
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Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 5 

<211> 3096 

<212> DNA 

<213> Homo sapiens 



<400> 5 

catcttctca 

tcagacgacg 

cagctccgtc 

actctgctaa 

cctctcccca 

ggggctgtat 

ccacaggccc 

ggaagggcag 

cccagttcct 

cttaaactta 

tcctggaaag 

tggggcctgc 

cagcaaatta 

gtgtaaaccg 

cttttgtacc 

ccgctgggac 

gcccctaagg 

cccatgggag 

tgcctggtga 

gtgggggctc 

gagtagaaag 

cggagctggg 

ttgtgtgtgt 

gcctgtgacc 

tgggacaagc 

attcacaaaa 

acaccgtgtg 

cacctgcttg 

cccctcctgc 

atcctgggcg 

gtgcacctgt 

cctgggggca 

ttggggtgcc 

gagcagctgc 

gggtgcatcg 

atagttgtcg 

gcaggggatg 

caatgtccag 

tttttctttt 

tcagctcact 

ctagcagggg 

gcaaggtttc 

gcctcggcct 

ctttaattcc 



cacgatctct 
gtccttcttc 
ccctaatatt 
tctgctgttc 
ccaaaggctg 
ggccttggtg 
tggtagggag 
gggccaccca 
tccaaaggaa 
gggaattgag 
agcctgcctg 
cgctcctgcc 
aaaaacacct 
tggtgaaatg 
cagggtttcc 
ctccagctgt 
gcaggtatgc 
gcctgttgtc 
ggtgcaggcc 
gctgcctccc 
ttccgtgtcc 
tccaccctgg 

gtgtgtttgt 

ccaagaacca 
tgggggccct 
tgggagtata 
ggggctgcct 
tgccaccagt 
cctcaggcct 
tgattcctgt 
tggccagacc 
gggacctggg 
tgcaggctgg 
cttcccccgc 
cccgcaggct 
cctttgaccc 
agatctgaag 
ttccgctgca 
tgagatggag 
gcaacctcca 
ttacaggcgt 
gtcatgttgg 
cccaaagtgc 
cacctgaaac 



cgcttcgcac 
gatctttgca 
catgcgcgga 
tctctccctc 
ggttttcttt 
cgaggccctc 
gagggggagc 
cacagagctc 
agccagctcc 
tgtgtggttg 
ccacagcggg 
cggctctcct 
ccatctctgg 
tggtctttcc 
aagagctcct 
aatagggaag 
ctgcctttac 
tgtgggafccc 
tggcaggccc 
gagacaatct 
agtcccactc 
tagggggact 
gtgtgtgtgt 
aggaatttca 
cgccacagga 
agtgatccct 
ggcctgggtg 
gccctctggg 
cagtgctgct 
gtgcctgcag 
tcctggtagc 
ctcctccctc 
gtggtgcagg 
ccgagggact 
gggtaggggt 
tgagttttct 
ataaatgcct 
gttataacat 
tctcgctctg 
cttctcgggt 
gagccaccac 
ccaggctggt 
tgagatgaca 
ttgccgcctt 



tccttccttt 
cattcttcta 
tggcccattc 
ccgtcgggtt 
gtcagggctc 
cagtgacagg 
agaagaggag 
tcccatttgc 
ccaggtggtg 
cttctaagtg 
ccttaccctg 
ggcctccccc 
cctttgaaga 
agtttgggga 
gcctccctcg 
gttttactgg 
ccgggttccc 
tccagcatca 
ggtcccccac 
atgtcattgt 
tgtgcgtgtg 
tcatggggcc 
gtctgaggag 
gcctgggtgg 
gcagccaggg 
gctctggcgg 
tgccgcgggt 
tctcacacac 
tccgtctcgg 
accccctcgc 
gggtgctgca 
actgagtgga 
tgaaatgcag 
tcacccgcag 
ggagcctggg 
tcgttatctg 
tagctgtgac 
cccatttttt 
tcacccaggc 
tcaagtgatt 
gcccagctaa 
ctcaaactcc 
ggtgtgagcc 
aagcaggtcc 



gattggtttt 
tcatctacta 
cgtggtgaaa 
ctgctcctgc 
ctttcccctt 
atcccccatc 
gtgccatctt 
tgtggaccct 
ggagagtgat 
ccttagaagc 
gctgtgccca 
ggtgtgggtt 
atgcatctga 
gaagcagggc 
gctgggctgg 
gttgctggcc 
ctcctgcctg 
gagacactgg 
ctgcttgagc 
tgtccaagga 
ttagcagggg 
tgggcgacag 
gtggaccagt 
atcacacctt 
catggggcac 
ccaggacgat 
gtccttgttg 
aactctcttc 
aaggccccag 
ggctgccatc 
ctcccctgaa 
gggaactcag 
acctctcagc 
cccagtcagg 
tggccctgcc 
tttggacctg 
catctccttt 
gatttctttt 
tggagtgcaa 
ctcctgcctc 
tttttgtatt 
tggccttaag 
accgtgcccg 
ccagtctccc 



caccatttac 
ccttcatacc 
attcccttct 
cacgttctcc 
tggaagaagg 
acccagagtt 
tgcctgctgg 
ggggccactg 
atggcttcct 
cgggagcggc 
cagatgtccc 
gggaaaagca 
acagccgaga 
agagctgggg 
ccagggcccc 
actgtggact 
gaagatacag 
ggccagcgtc 
acccacggtg 
agctaattta 
actctcgggc 
cactgtgtat 
ttctcaaaag 
cactggtgag 
agttggcctc 
gagtgggaac 
gtgatggttc 
ccagcgaagg 
gagctcctgc 
tcatcctttg 
tgtgccgggg 
tgtcttggag 
tggtgttcca 
ggtggcgcct 
tgtgagctgc 
tttggggcag 
tgtgagaggt 
tattttttcc 
tggggtgacc 
agcctcctga 
tttagtagag 
tgatctgccc 
gcccagaact 
tcccctagtc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
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cctggtccca ccattctgct 
aaattgaatc ataaagtatt 
cttaagtttc atccatgttg 
tttttttttt ttttgcagac 
gatctcggtt cactgcaaca 
cttagcagct ggaactacag 
tggaggcagg gtttcaccat 
gggcccgaag gacccgggca 

<210> 6 
<211> 26928 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 

<222> (12044) , (12489) , (26433) , (26434) , (26435) , (26436) , (26439) , (26441) 
<223> Identity of nucleotide sequences at the above locations are unknown* 

<400> 6 

gaagaccaag ggcacacagc gaggcagttt cagggcgggc agcctggggc cccacggggc 60 

ggccccggac acttgttctc acctgtggag ggcagagaag ggaacaggga gagaagtggc 12 0 

cggctgggag tggaggtggg tttgaggttt tactgtaaac taaatgtgta ccctctacct 180 

tagttatgaa ttatgagaca cgaagactgc gaaacagaca cactcctcta aaagtgcctc 240 

taggctgaca gggagaaagt cccgccaggc tcccagacgc cacctttgag tccttcaaca 3 00 

agcccgccag ggcctcttgc ccaccggtgt cagctcagcc actgaaccct ccaggaagaa 3 60 

gacgtgctgg taggagaaga atctcaccca ggcacagcct ggaaggggca cagaaggggc 420 

tccggaacca gcaagcccaa gttggaactc ccagtctgct actttctaga acgactgtgc 480 

ccttggcggg tctaagtaga acctctccgc gcactctttc ctcctttgta aagtggggac 54 0 

agcaatggcc accttgcagg ttcagagagg gcttgcagta cctcacagaa ctgagtgccc 600 

gtgaacgtgt gtgttcctcc agatttgtga cagctttgcc aggctggagt caggctgaac 660 

gcctctgccc tcatggggtt tatattctag gaagaccaac aaaaacaaga agacggaaaa 720 

ttaaaacaac aaaagcccca ttgacaggcc gtgaagaatg ccatgaaaaa tgaatggcgt 78 0 

tgtgctgcag tctttgggga aacgggctta cggaaagaag gacacttgag ctgctaccaa 840 

tgagcagccg tccggtggga gggcagttca ggaagagcag acatccactg aggaggcgct 900 

<?gggcagagg gcagcctggt cgctggattc gggggaggaa ccacatcagg ccatgagctg 960 

gagctggtgg tagaatgtac aggagaggcc agccagggcc agctcatgtc agacctcaag 102 0 

cggggaagat gaatcgagaa tgcaccccac gagcaatggg aagccagtct acgatttaag 10 80 

cagcaaaaat attttccctt cttccaccct gcatccagct ctaccagcac agcctggggt 1140 

tctattttca agatagaata gacccagact cccagctctt cttacacttc tactactgcc 12 00 

acctgtcacc cactcatgcg tccccacttg cagcctcgac ccccttccac ctgatctcat 1260 

ggcagccagg gaagctccag ggctcgtgag ggctgccatc tcaggaaaga agcaaaagcc 1320 

ttcggcacct gcagggcctg ctccaaccac acttcttcct tgacctctca gcttccttag 13 80 

ccactccctt cccacatctc accctgctcc agccacagtg gtgtctctgt gggttctcaa 1440 

acacaccagg tgcactcctg cctcagggcc tttgtgcttg ctgttctctg ctgggactct 1500 

tttttttttt tttttttttg agacagggtc tcactctgtg gcccaggctg gagtgtagtg 1560 

gtgtgatcgt agctcattgc aacctcaaac tcctgggctc aagcaatcct cccacctcag 1620 

cctctcaagt agttagcttt tgttgttttg ttttgagatg ggatctcact ctgttgccca 1680 

ggctggagtg cagtggggca atcttggctc accacaacct ctgcctccca ggctcaagca 1740 

attctcctgc ctcagcctcc caagtagctg ggattacagg catgtgccac cacgcccagc 1800 

ttatttttgt atttttagta gagacagggt ttcaccatgt tggtctggct ggtcttgaac 1860 

tcctggcctc agatgatcca cctgcctcgg cctcccaaag tgctgggatg acaggcatga 192 0 

gcctgtctct agtagttagg actacagaga ggggccatca tgcctggtga tcctcccacc 19 80 

ttttctgctc caactctttc accccactta gcctcgtggc tcactctctt acctcttcag 2040 



ttctgtctca atgaatttgc ctaccgtaag tacctcatat 2700 

tgtcttttta tatctggctt atttcactta gcataacatt 2760 

tagcatgtgt cagaatctct ctcttttttt tttttttttt 2820 

agagtctcgc tctgtcatct agactggagt tcagtggcac 2 880 

tctgcctcct gggtccaagc aattctcctg cctcagcctc 2940 

gcgcgtgcca ccatgccttg ctaatttttg tatttttatg 3 000 

cttggccagg ctggtctcga attcctggtc ttcaccacgg 3060 

aagcgtggag gggagg 3 096 
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ctcctcagtc 
cacccactat 
catcaccacc 
tccaggagag 
cctagcacac 
ctgatctaca 
gaaagaggcc 
agggcctctt 
gaagggggcc 
caggtttcta 
attgaaatga 
aaggggccac 
acctataatc 
aaccagccta 
ggcatggcag 
agaacccagg 
cgacagagca 
gcctgtaatc 
gaccatcctg 
gcgtggtggc 
gaacctggga 
gggcgacaga 

tgggcttggt 

cttgaacccg 
ggcgacagag 
gcctgggacc 
tgagaacatt 
ccagcacccc 
tgtgctgagc 
tttcttaggc 
tcccaatgaa 
cccatgagga 
ccaggctgga 
ggcaggactg 
ctgagggcag 
gctgccgcag 
gccagggaag 
gccatggttt 
gcctcagttt 
agtgctgatg 
tttcatttat 
ttacggctca 
aagtagctgg 
ggtttcactc 
aacctccctg 
tatgtgccag 
ctggtcttga 
ttataggtat 
tagcacttta 
tgtgttgccc 
caaagtgttg 
tgttcttgtt 
tcccgggttc 
gccaccacac 



aggcctgagg 

tgccagcact 

tgactcatta 

cagagagtgt 

agtaggtgct 

tttataaaga 

cctgtggggg 

ggactcaagt 

caggatggcc 

aatttgggaa 

ttacatagaa 

cagagactcc 

ccagcacttt 

ggcaacatgg 

catgfcgcctg 

aggcagaggt 

agactccgtc 

ccagcacttt 

gctaacatgg 

gggcacctgt 

ggcggagctt 

gttagactcc 

ggcgggcgcc 

ggaggcagag 

caagactcca 

caaagcacac 

catttgcaaa 

aaggacagtc 

ttccacttcc 

ctcaaatttg 

gtaacagggg 

tgtcgtttca 

acgtgctggc 

actgctgtgg 

cgccggagga 

gcctctcccc 

gccgccatcg 

ctatgagcca 

ctgcattcaa 

tgaacctcca 

tttaagataa 

ctacagcctt 

aactacaggc 

tgttgcccag 

ggcttaggtg 

tacacccagc 

actccagggt 

gagccacccc 

aaaaattttt 

aggctggtct 

ggattataag 

gctcaggctg 

aagcaattct 

ccagctaatt 



acccctgttg 

ttctactcca 

catgtttacg 

agttttattc 

cagtaaatgt 

aggcactctg 

cctggccagc 

ggatggggcc 

tgtcatagag 

gggatacaga 

aatgtaccaa 

cagagaggaa 

gggaggccga 

taaaaccccc 

taatcccagc 

tgcagtgagc 

tctaaaaaaa 

gggaggccga 

tgaaactcca 

agtcccagct 

gcagtgagcc 

gtctcaaaaa 

tgtaatccca 

gttgcaatga 

tctcaaaaaa 

tactgcaagg 

gaatgaatta 

atttttagac 

ccagacggtt 

gaaatgttca 

ctcagcacct 

gcccggtgac 

tctgtgaagt 

tcagaggcgc 

ggccccagtg 

aaaacaggca 

gcctacagtg 

gctggggtgt 

aaagtgggaa 

ataagacttt 

tgtctagctg 

aacctcccag 

atgcaccacc 

gctggagtgc 

atcctcacac 

taatattttt 

ttaagcaatc 

ccggcctata 

ctaaattttt 

tgaactccta 

cataagcctt 

gagtgcaatg 

cctgcctcag 

ttgtattttt 



aaaattgcaa 

tttctctgct 

tatctttctt 

cctgatgtgt 

gtgttggatg 

gctgctgagt 

aggtgggtac 

tgctcagggc 

gacacattac 

aaaactaaag 

gaattaaaaa 

agggactatg 

ggagggtgga 

gtttctacta 

tactcgggag 

cgagattgag 

agaaaaaaaa 

ggtgggtgga 

tctctactta 

acttgggagg 

gagattgcgc 

aaaaaaaaaa 

actactcggg 

gccgatatca 

gaaaaaaaaa 

tcccagggtg 

aaattcagca 

ccttcagtaa 

gcctgtcaca 

ctctcagttc 

gtccaatctc 

acttgggcaa 

tacgctgggc 

ccctggggct 

agagtgccca 

atgcttctca 

ctgggctctg 

gaaggacaca 

tcatgatatc 

taaaaatact 

tctcacccag 

gctcaagtga 

gcacctggat 

agtggtgcac 

ttcagtctcc 

gaaggatggg 

taccttcctc 

atcctaccac 

tatagagatg 

ggatcaagcc 

accttacctt 

gcaagatctt 

cctcccgagt 

agtagagatg 



accacacccc 

ttacttttct 

ctctccacta 

ttcctgtgcc 

aacaaataca 

ggggatgaga 

aatggtagca 

tccggccaca 

aactggccca 

actctactgg 

aaaaaaaaaa 

ggctggatgc 

tcacgaggtc 

aaaatacaaa 

gctgaggcag 

ccactatgct 

ggccagatga 

tcacgaggtc 

aaatacaaaa 

ctgaggcagg 

cactgcactc 

aaaaaaatta 

aggctgaggc 

cgccactaca 

aagaaagggg 

cctgactcca 

ctattttatt 

cgtaataagt 

gctcatcagg 

gttccttaga 

attgcttccg 

agtgcctttt 

acaagagccc 

ttgggagcca 

ctgccaggct 

gcaacctgcc 

gagggcttgg 

ggctggattc 

tgctctattt 

ctttctacct 

gctggagtgc 

tcctcctacc 

aattttttct 

tcttggctca 

caagtagctg 

gtttcactat 

agcctgccaa 

tttaaaaaag 

ggggacagct 

atcctcctgg 

ttttttttga 

ggctcactgc 

agctgggatt 

gggtttctct 



ccaccaccac 

cctttgtact 

gcatggaagc 

cgtaccaggg 

gtgaaaggat 

ctgtcaggag 

gccaggagag 

ggaacaaagg 

aagttcaagt 

acagtcagtt 

aagcattatg 

ggtgactcac 

aggagttcaa 

aaattagctg 

gagagttgct 

ccagcttggg 

ggtggctcat 

aggagatcga 

aattagccgg 

agaatggcgt 

catccagcct 

gctgattagt 

gggagaatca 

ctccagcctg 

ctgtgctgtg 

accggagcct 

ctgcaggatt 

aaccggagga 

ccaacaaact 

tgcaagtcca 

gggacagggg 

ggtttccctc 

cccccaaccc 

cagaatcttc 

cattcctcag 

ccaggagcag 

ttggtaacag 

acctctctgg 

cttatctctc 

tacttttatt 

agtggtgtga 

acagcctccc 

tttgagacaa 

ctgcagcctc 

ggactacagg 

attgcccagg 

agtgctagga 

cctgtaattt 

gtggtctcac 

cctggcctcc 

gttgcagttt 

aacctccacc 

acaggcatgc 

atatacctta 



2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 
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attttaaagc 
gatgaatgag 
gtcatcgtcc 
cataacttgc 
tctgcagtga 
cagatgaagg 
cagtggattg 
gaatgttaac 
atgaggagct 
gaaaagaaaa 
gccgaggtgg 
aatcccatct 
ccagctactc 
tgagccaaga 
aaaagaaaag 
catgtgaaga 
cccagaagtt 
ctgccaacac 
tgttttaagc 
acctaatctt 
tgtgcagtgt 
fcggctcacgc 
aggagttcaa 
attagccagg 
gaatcacttg 
agcctgggag 
agactgggta 
catggtgccg 
aggcagagcc 
gggaggtgtg 
caccaagcca 
cattgggaat 
cacccctggc 
cagtagcccc 
ccaacgtctc 
tatttactgc 
ggagaaatcg 
caatcattca 
gcacactggt 
cagaattcag 
ggtgcagggt 
tcccgcagct 
atttcccctc 
tttctcatac 
ctgcacacct 
tgaagcagca 
ctagatgcag 
gacaatgaaa 
atctctgaaa 
attggcagtc 
ggcttgaact 
ttttccaaac 
atttgctcct 
tgctgcttca 



actgcattca 
ccaggcagtc 
atccgtttct 
ctcctgggaa 
aaaataaaac 
agttgacatt 
aacagtggcc 
caagttagga 
catcgtggat 
gcagaggctg 
gtggatcacc 
ctactaaaaa 
aggaagctga 
tcgcgccact 
aaaagcagac 
tgaggcagag 
gcaagaggta 
ctccacctca 
caccagattt 
cacaaaccca 
gttaggccat 
ctgtaatccc 
gaccagcctg 
catggtggcg 
cgcccgggag 
acaagagaga 
gtttataaag 
gcatctgctt 
agagcaggtg 
cacactttta 
cgaggggtct 
tacagttcaa 
cccccagatc 
ccaaagtctt 
atctgaagat 
tgagatacaa 
gtcaaaagaa 
atcttaaagc 
gcaaggggca 
tccccatggc 
gcaagctgct 
ccactaggca 
caatgggaag 
atcctctgac 
acaggcttaa 
gcctgagctg 
gcagggagca 
ccattctttc 
tgcctttgag 
ggctcctttt 
cctctcctga 
ttttatgctc 
gcatttgagc 
aaatttatgg 



tgtaaattgt 
acctcggctg 
gcagctgttt 
agaatgccct 
taagtagtct 
caggcttcat 
ttcaaaaaga 
aaagggtctt 
tatccaggtg 
ggcacagtgg 
taaggtcatg 
tacaaaaatt 
ggcaggagaa 
gcactccagc 
agagatctga 
gttggagcca 
ggaagaagcc 
gacttctggc 
gtggtaattt 
tcttacagaa 
tcttgtatta 
agcactttgg 
gacaacatgg 
tgcatctgta 
gaggaggttg 
aaccctgtct 
aaaggggtta 
ggttgctggg 
catcacacag 
aatgagcaga 
gcccccatga 
catgaggttt 
tcatgttctt 
aactcatccc 
gagttccttt 
tgggggcata 
aggggctata 
tccaaaataa 
ggctcccaag 
tgctcttaca 
ggtgatctac 
ttgtcccagt 
gctctgcccc 
atctaggtgg 
caccacatgg 
tgactatggc 
gtgtcctgag 
ctcctcatcc 
gcctttttgc 
tagttatgca 
aaaagctttt 
tgttttacct 
atagggaatt 
ccacgcttgg 



gattaacatg 
tcaccctcca 
gtttgactct 
gggcaggccc 
ggtccacaga 
tctcatttat 
tgtatctaca 
cccgggtgtc 
gaccctgcat 
ctcaagcctg 
agttcgagag 
agcaaggcat 
tggcttgcac 
ctgagggaga 
gacagaagag 
tgcagccaca 
tcccctagag 
ctccagcact 
gttatggcag 
aaggaaactg 
ctataaagaa 
gaggccaagg 
tgaaacccca 
gtcccagcta 
tagtgagcca 
caaaataaat 
actggctccc 
aaggcttcag 
caaaagcagg 
tctcacgaga 
ccca.aa.cctc 

ggggggacaa 

ctcacattgc 
agcattaact 
cacctacaag 
ggcattaggt 
ggccccaagc 
cctccttaaa 
gccttgggca 
gattggagat 
cattctgggg 
ggggactcta 
tgcagcagcc 
atggtgtcaa 
aagctgccaa 
cctttgagcc 
gctgtgcaga 
tctgggcctg 
ctctgaggcc 
aatcctctag 
tctttctttg 
ttaaatataa 
caaagaagct 
tggctcacac 



gattcaagag 
cttctctcct 
cctgatcatt 
acatgagtag 
gcagtcttat 
aagtgtttta 
tcctaatccc 
attaagttag 
ccaaggacaa 
taatcccagc 
cagcctggcc 
ggtggcgggt 
ctgggaggcg 
aaagtgaaac 
gagagtgaag 
agccaaggaa 
cctccagacg 
gtgagataat 
ccacaggaaa 
aagtcagaga 
atacctgagg 
tgagtggatc 
tttctactga 
ctcaggaggc 
agattgtgcc 
aaaaaacaaa 
ggttctgcag 
ggagttttac 
agcgagagag 
actcaccatt 
ccactaggcc 
atatccaaac 
aaaatatagt 
caaaaatccc 
actgtaaaaa 
aaacattcct 
aagtccaaaa 
ctccatgtcc 
gctctattcc 
gagggcctgc 
tgtggatggt 
tgtggggcct 
ttcttcctgg 
gcttccttca 
ggtgtatggc 
aaggctggag 
gcagcagggc 
tgatgggagg 
tatttcctat 
caagaggtta 
tcacatggcc 
cttctaactt 
gggccacatc 
ctgtaatccc 



agggagtgag 
ccttctgaca 
ttgcttgcca 
tgaaaaataa 
tttttcactg 
aagacacata 
tgggacctgt 
agatcttgag 
atggtcctta 
actttgagag 
aacatgatga 
gcctataatc 
gaggttgcag 
tctgtctcat 
gaaaaaaggc 
tacctggagc 
gagcacagcc 
caactgctgt 
ctaatacagt 
ggtagtggct 
ccgggcatgg 
acttgaggtc 
aaatatgaaa 
tgaggcagga 
actgcactcc 
taaacacctg 
gctgtacaag 
tcatcgtgga 
agagagagca 
gcaaggacag 
ccacccccaa 
tatatcattc 
catgccttcc 
attcccaagt 
tgaaaacagt 
gttccaaaag 
cccagcagag 
catagccagg 
tgcggctttg 
ggcttttcca 
ggcggccccg 
ccaaccccac 
gctcccaggc 
ctcttgcact 
tggaaccctc 
ctggaacagt 
cctgtgcctg 
gttgtggaag 
tgtctcagtt 
ctccactgcc 
aggctgcaaa 
taattcattt 
ttgaatgctt 
agcactttgg 
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gaggcctagg 
aaacccatct 
tcccagctac 
agtgagccca 
aaggaaggaa 
ctcttatgtt 
gctaaggcat 
gagactcatc 
tttaaccagc 
tgcagtgagc 
tcaataaata 
caccatggag 
agctgcctct 
agcaccccat 
ccctcacagg 
acaccttttc 
ccacggcccg 
atgtgtgtca 
gatgaacaag 
ctgttagatt 
tagaactgag 
gttggtcact 
atgggtggaa 
agtctttttt 
cagtggtgcc 
ctcagcctcc 
atttctagta 
ccgccttggc 
cacaagtctt 
tccaggcaaa 
ctgatgctat 
tgggaggccg 
gtgaaacccc 
aatcccagca 
cctggctaac 
tggtggcggg 
gaacctggga 
gacagagtga 
ttgcaggtgt 
aactctcatg 
ttttttagag 
ctcactgcag 
gctgggacca 
ctggcaaagg 
ggtgcagtgg 
ggtaggcgga 
tgtctctaac 
gctactaggg 
gccgagatcg 
aaaaataaaa 
actgtaaaga 
ctccctggct 
tggactgctt 
cccggccccc 



tgggcagatc 
ctactaaaaa 
tggagaggct 
gatcatgcca 
ggaaggagga 
caaacttcaa 
aacgaaagtg 
accctggcct 
taatcgggag 
tgtgatcaca 
aataaataaa 
aaaggccagg 
gctcctcaag 
gacctggctc 
ctcttgtggg 
ctgatcatct 
ccagacagct 
tgcccccaac 
agagggacaa 
ctcaaggtgg 

ggggcatttt 

ctggggctgg 
cccaggacag 
tttttttttt 
atcttagctc 
tgagtagctg 
gagatgagat 
ctcccaaagt 
aaaccatgcc 
aatagatgaa 
taagaattga 
aggcgggcgg 
atctctacta 
ctttgggagg 
acagtgaagc 
cgcctgtagt 
ggtggagctt 
aactccatct 
gacaaatgat 
gaaagcatca 
acagggtctt 
cctcaacctc 
caggcgtgag 
tagcaaaacc 
ccctgacagg 
tcacctgagg 
aaaaatacaa 
aggctgaggc 
cgtcattgca 
aaaaaataaa 
aaaaataaat 
ggacagatac 
gaggtgctgt 
accaagggcg 



acgagatcag 
tacaaaaaaa 
gaggcaggag 
ctgcactcca 
agggaagaaa 
cagatcccta 
acctttgctc 
tggcttgtcc 
gctgaggcaa 
tcactgcagt 
tacataaata 
ccagctcctt 
gaggaacaga 
tggggccttg 
caccttggac 
gggaagggca 
gcagccacag 
ccatctccag 
gtccccaagc 
atggtgggct 
caggaataaa 
tggagtcagg 
gaacacagag 
ttttgagatg 
actgcaacct 
ggactacagg 
ggccaggctg 
gctgggatta 
tcagcacatc 
agacatagga 
gggctgggag 
atcacgaggt 
aaaatacaaa 
ccaaggcggg 
cccgtctcta 
cccagcaact 
ccagtgagcc 
caaaaaaaaa 
tctgtttctg 
tgctgttggc 
gctctgttgc 
ctgggctcaa 
ccactgcact 
ttaacaaatt 
gatggctcac 
ccaggagttt 
aaaaattagc 
aggagaattg 
ctccagcctg 
ttaatgaatt 
gtatatatca 
atcccacaac 
gctggagcac 
aggtccccct 



gagatcgaga 
ttagcttggt 
aattacttga 
gcctggtgac 
tgtcttcccc 
gggcatgaaa 
cagttcccaa 
atatcactgt 
gaggatcact 
ccagcttggg 
acttaagttt 
ctctctttct 
gggagtagga 
tgggtttatg 
attggtcaga 
gcttgtgcca 
ccttgccttt 
gggacacatg 
ctctctctcc 
aactagggca 
actgcaaaag 
tttccttctg 
caggccccag 
gagtcttgct 
ctgccttctg 
tggcaccacc 
gtcttgaact 
caggtgtgag 
aatgccattt 
tgattgatca 
cggtggctca 
caggagatcg 
aaatggccgc 
cggatcacga 
ctaaaaaata 
tgggaggctg 
gagatcacac 
ataaataaat 
tggcagaatg 
aacatcacat 
ccaggctgga 
gcaatcctcc 
cagcccaatg 
ttgaatctag 
atctgtaatc 
gagaccagcc 
cagacgtggt 
cttgaacccg 
ggcaaaagca 
aattaaaata 
gccaacaagt 
acctggaagg 
ggtgacagag 
gtggtgggtc 



ccatcctggt 
gtggtggcgc 
acctgggagg 
agaataagat 
ccagatgtcc 
ataatacagc 
taagttcctc 
cagcattttg 
tgaacccagg 
caacagagca 
atttaaagct 
gcacgtgttc 
aaggccatcc 
gattcccagt 
agcatgtggt 
gcgaggccac 
gatcagagca 
tcctttcttg 
ttcctgcctc 
accgaccatc 
tctggagcaa 
caggccccct 
gaccgggctt 
ctgtcatcag 
ggttcaagtg 
acgcccagct 
cctgacctca 
ccactgtgcc 
acaaaaaggt 
tgtcctgctt 
cgcctgtaat 
agaccatcct 
gcgcggtgac 
ggtcaggaga 
caaaaaaaat 
aggcaggaga 
cactgcactc 
aaataagaat 
ttctcaggag 
ttatttttat 
gtgcagtggc 
tgcctcagcc 
taccttcaat 
ataataaaat 
tcaacatttt 
tggccaacat 
ggtgcacgtc 
agaggcagag 
agagcgaaac 
aaataaaata 
gatggaatag 
cggctccatg 
gagctggacc 
tgagggaggc 



caacatggtg 
agacctgtag 
cagaggttgc 
ttgatctcga 
tgggtcatcc 
caaattattt 
atttccatct 
gtcacaatca 
aggttgaggc 
agatcctgtc 
gcatctttgc 
ctcccacctc 
caggaggccc 
gctgagtcat 
ccccgggaac 
ctgttcagcg 
aacaccagac 
ccaggcctga 
acccactccg 
ctggtttacc 
acaggagcaa 
ccccgcaagc 
gtcacttaca 
ggctggagta 
atccccctgc 
aattttttgt 
agtgatctgc 
tggccccact 
agagggattt 
aaacataggt 
cccagcactt 
ggctaacacg 
tcacgcctgt 
tcgagaccat 
tagccaggca 
agaatggtgt 
cagcctgggc 
tgttagtatt 
atctcttttg 
ttatttatta 
acaatcacag 
tcccaaagta 
atttacattt 
tatgaggctg 
gggaggccaa 
ggtgtaaccc 
tgtcatccca 
gttgtgatga 
tctctctcca 
atggatagtc 
agcaccccat 
tagaactttc 
atggacctcc 
atccgtatgg 
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cctctgcggc ttgggcaggg aatttggggt ccaagtactt ggtgcaaagc ctggaaagag 
ggtttgggtg ctgagggcat atcccctggg ccacatgggg gcagaagtgg ggccccctga 
agcttggagt cctgggcagg ggcatctatt ttgctgtctg aggccttcag tacttgaagc 
aaaatggagg cagaatgtcc caccttaatg cccctgattc ctccaaacca attccagaga 
cagcaagggc cagaacaggg atggccctgc ccagggtcat gcancgagga agtggccagg 
ctgggatctg aacccaggct aatcccctcc cttgtcctcc tccaggccct cacccctgca 
tagagccctc cagctcactc atcctcggcc agctccatct cctcagcttg taaacccccc 
cgggattttc ctttcttaaa aaacaaaggc ttggccaggc acggtggctc acgcctgtac 
tttgggggtg gctcccagca ctttgggagg ccaaggtggg cggatcatga ggtcaagaga 
ttgagaccat tctggccagc atggtgaaac cctgtattta ctaaaaaaaa aaaaattaac 
tgggcatggt ggctagctac ttaggaggct gaggcaggag aatcgcttga acctgggaga 
aagaggttgc agtgagccaa gatcgcgcca ctccacttta acctggcaac agaacaagat 
tccgtttcna aaaacaaaca aacaaacaaa taaacaaaaa aaggcggagc gcgatggctc 
gcgcctgcaa tcccagcact ttgggaggct gaggcgggcg gatcacttga ggttaggagt 
ttgagaccag cttggccaac atggtgaaac cccatttcca ctaaaagtac aaaaatcagc 
caggtgtggt ggtgggtgcc tgtaatccca gctactcagg aggctgaggc aggagaatcg 
cttgaaccca tgacctggag gctacagtga gctgagattg cgccactgta ctccagcttg 
ggcaacaaga tttgtttctc taaaaaaaaa aaaaaaaaga ctggcccttc cccttcagct 
cttcctcagg gtccctgagc actctacacc cccgtctaca ctgagcactc caccctgctg 
tctacactga gcactccacc ctgccatcta cactgaggac tccaccccac tgtctacact 
ggctgcctcc cgccctcacc tcctgctaag gccattcccc gctgcatctg tcttctagat 
tctgcagcct tcagcacgct gggcccctcc tttgtcccct tgagccacct ccagcctccc 
cctgagctgc tactcctctc ccagcagcct ccacccaagc ccctccagtc cccaagctgt 
cccttgcatc cagcactgcc cttccacgtg ccccttccct ccagcttcac agcagggtgg 
ggcctccagg ccctgcccac tgtgcccatc cacaagttgt ggtgggagct ccgaggggag 
gcaggggtgt gcatggactt gggacgtcca agtctgggac caggggcagc tggttggtgg 
agtgtggagg gggataggga ctttcaggta gagaggctgt aggggcaaga tcgggacggc 
ggatgtccct aaggagggct ctgacctggg aaatattgtg cagcttcctc tttgccattc 
ctggagctca gacactggcc ggctctcacc ccgcccttcc tgcaggacac agctccatcc 
cagtgagttc ctagtgtaga catctccagc agcacggatg ggaaaggaag tcatcaaagg 
tgcccaggac cggaggcttt ttctggaggt ggcagaggag ggtgtgggtc tcagggctct 
ggctgagggc aagcgtggga ggtcttaggt ctgcaccagc cccgtgaagg cccctcctgc 
tccctggtgg agtcctagag ggaacagcag cccctaggct ctagcaggag tgggtagggg 
cttttctggc ttcctactgt gccagcagga tagctgggcc tggcactgag cccaaagatc 
acatgccggg gcattggcgc agtgaggaac agacccttgc caaagctggc aaagaagacc 
ccatggggtg cagctggtga agctgagagc tcaatgtttg ggggagcctg gcaaaagggg 
tcctcccctc cctctgcagg ccaggatcgc aggttttccc tacatgttgg taattctcaa 
acaatcccat ggccactgga gcaaagatca cagtgggcgg cggcctcggg agcagtggac 
agggcacgca gtgcctttga tgccagagcc ctcgccccaa agtcaacaaa ctctgcagcg 
gactttgcac ccggactttg ttttcaccat acaaggaaag ggacagatca caggccctct 
cgctgccctc gctgagccgg aagctgcagc gtgagctctc tcaagcccca tttctaggtt 
ccccaggcgc acccctgagc ccctactcgc ctattaagtt ctcctaatag cccttcaagg 
tcttaatgta tgtccattag acagagggga aaactgaggc gagggcaagt gacttgaccg 
aggttcctcg gcgagcaggg cgtggagctg agaacctcgt tattactgct ccccacacaa 
ccctctggcc gttcttggaa gaaggctgag ccccgggggg gccagagtga cccaaacacc 
atgggccgcc tgcggtaaca cgtgcggcca cgaaggggca gcagtttccc gcccggccgg 
gctctctccg gcgctcagta tccgtcccag gccaagaaga agaaactcgg 9gaggagggc 
ggagggggct gcgtgggagg gcgtggaaga tggacgtggc caggggagtg gcagctgcac 
acagtggatg ctgttaagat gaagggaaag aacgtgggct ccgagatcac tggacacggt 
tccacctttc ttcccgctca ctgcatggcc ctgggcgggt tgttgaaccc ttggaaacct 
gtttttcctt ttttcctttt tttttgagac agggtcttgc tctgtggccc agactggagt 
gccgtggcac gatcttggct cactgctgcc tcccaggttc aagtgatcct cccagctcag 
cctcctgcgt agctgggacc ccaggtatgt gtcaccacag ccggctaatt tttgtatttt 
tttgtagaga cgggatttcg ccgtattgcc caggctggtc tcaaactcct gagttcaccg 
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gatcttcctg 

cagagacctc 

gtgctgggaa 

gccccaggct 

agggccctca 

aacttctgcc 

gacttcgagg 

cctggtttaa 

tcgtgccgct 

cacccagctg 

cctctgctga 

tcaggactgg 

gggcacattt 

tgagcaccca 

tcagagaggt 

tgagatatgc 

tatgtagagt 

ctgacgatag 

atctccttgg 

tagtggctta 

ctcacccact 

agtgtccagg 

tgaaggtgag 

atggaatgta 

tagagtctca 

ctcgacttcc 

gacacacgcc 

cactttgttg 

tcccaaagtg 

tctgcttata 

aagaacaaaa 

tactttcagg 

tattttatat 

ttttagagac 

cactgcagcc 

ggactaaagt 

tgttgcccag 

aagtgctggg 

ttactaaaat 

ggcatctaca 

tgctgctgct 

aatcatttta 

tgaattttta 

cacttttcga 

caacatggtg 

ctgtaatccc 

ccatcctggc 

gtggtggtgg 

acccaggagg 

cagagtgaga 

cgtggcgtgc 

ccaggaggtg 

gagtgacact 

ggtcttccta 



cctcagcctc 

agttttctaa 

ttaagtaaga 

gggttagcag 

ttaccacctg 

acccaagttg 

cctcttcatc 

cacagcccag 

gtgcctcgca 

agagtcaggg 

gcacttccct 

gacagcttct 

tcccttggcc 

ggcaccccag 

gaagcgcctg 

acctgcggct 

caagagagtt 

acactcaggg 

aagaaaagtt 

ggaccctcag 

caccccttgc 

aaatgtgcaa 

gctgaaagtt 

tgtatgtatg 

ctctgttgcc 

tatgcccaaa 

aactatgcct 

cccaggctgg 

gagggattac 

aaattaatac 

agcgtatata 

cttttttctt 

tttattttat 

agggcctcac 

tcaaacacct 

gtgagtctgg 

gctgatctca 

attacaggca 

cccatggaca 

tcattatggc 

tagatttttc 

tcaaattccc 

cattaactta 

ggccaaggca 

aaactcggtc 

agcactttgg 

taacacggtg 

gcgcctgtgg 

cggagcttgc 

ctccatctca 

acctataatc 

gaggttgcaa 

ctgtgaaaaa 

tccaagagca 



ccaaagtgct 

cctgtgccag 

tgaccgggta 

cggtggtcag 

ggtgagtcct 

gcagattgtg 

ggacctgctt 

ctttcctcct 

gtgccactcc 

cccctgtgag 

gcatgattgc 

caggaccgag 

aacatcagag 

caggtagctc 

gcctggggcc 

ctgctcacag 

catgttaact 

atggcctacc 

ctgggcagaa 

cggtggataa 

cagccccaat 

atggcctggg 

ggagaaacag 

tatgtatgta 

caggctggaa 

tgatcctcct 

agctaatttt 

tcttgaaccc 

aggtgtgagc 

attcttattg 

atttcacagc 

tctaggtaca 

attttttatt 

tctgtcaccc 

gggctcaagc 

ctaatttttt 

aactcctggg 

tgagccacca 

ctttgacatg 

catctatcat 

tggcctgtct 

cttgcagaaa 

ggaaagggct 

ggtggatcac 

tttactaaaa 

gaggccgagg 

caaccccgtc 

tctcagctac 

agtgagccaa 

aaaaaaaata 

ccagttactt 

tgagcagaga 

aaaaaaaaaa 

caatatagct 



gggattactg 

caggaataat 

gcaaatatga 

ggctgctgct 

cgtccaagcc 

tgcaaagtta 

ccggtgactg 

gctgagccct 

ctgtaccctg 

gctctgccca 

ttctgagagt 

gcttcctggt 

actggacatc 

tgtaaccaac 

acagcctgcg 

ggtcctgcac 

ccctctcaca 

ctccccaaca 

ttccaccgtt 

gttgtgggca 

aagctcaata 

acagccctgt 

aagccagtgc 

tgtatgtatt 

tgcggtgaca 

acctcagcct 

tatttctatt 

ctagcttcaa 

caccacacct 

cagaaaagtt 

tcagatctca 

tatgcagaga 

tcattatttt 

aggctggagt 

aatcccccca 

ttactttttg 

ttcaagcgat 

tgcctggcct 

tctgtattct 

ctatcataat 

cctatttgta 

aagccctatt 

gggcacggtg 

ttgaggttgg 

atacaaaaat 

tgggtggatc 

tctcctaaaa 

ttaggaggct 

gatcgcgcca 

ataataataa 

gggaggctga 

tcgtgccact 
ttctgaagga 
tttcatgtat 



gcatgagcca 

gatagctgcc 

agtattactg 

tcctggcctg 

tggcactgct 

agtcctgact 

cattcgcacc 

ccctgggcct 

aatactttgc 

gcccgtcctc 

ccccccagcc 

ctgcttgcaa 

tgcagatctg 

ccattctgta 

tcagctgcag 

agactgctgc 

tccctcagcc 

acccccgtca 

ggcctggcct 

gaagagatgc 

agctgggctc 

gttcctttca 

ttatggtttt 

tatgtattta 

caatcatagc 

cctgagtagc 

ttttgtggag 

gcaatcctcc 

ggcctggaat 

tgaaaataaa 

ctgctattaa 

ttattttatt 

attttatttt 

acaatggagt 

ctcagccttc 

tattgacaga 

cctcccacct 

aaaatgccac 

atgctattga 

ccattttaac 

ttcttccaga 

ggattttggt 

gctcacgcct 

gagtttgaga 

tgccaggcgc 

acgaggtcag 

atacaaaaaa 

gaggcaggag 

ctgcactcca 

tacaaaaatt 

ggcaggagaa 

gtactccagc 

ttgagactct 

tcaagccttt 



ccgcacccag 

tagcttggct 

gacacagagg 

agctcgaagg 

gcgtgggaat 

ctgtggggtg 

tcctcctgtt 

gctgtcaccc 

cctgcctctc 

cgggtttctg 

tgtgagcttc 

ttttacaggc 

tgctagccac 

aagctgaggc 

agccaggagc 

tggagccacc 

agggtggggg 

ggtttgccgg 

acactctcct 

aatcaggatt 

ggtctgagga 

gtaaggttgc 

aattaagata 

tctttagaga 

tccttgcagc 

tgggactaca 

actgggttct 

tgcctcagcc 

ttatttgtat 

agaaaggaca 

catttttatt 

ttatttattt 

attttattat 

gatcatagct 

tgagtagttg 

ggtctcacta 

tggactccca 

tttttgtcat 

tctgactgtt 

attaaaattg 

taaattttag 

tgaaaaatac 

gtaatcccta 

ccagcctggc 

attggctcac 

gagatagaga 

ttagccaggc 

aatggtgtga 

gcctgggcga 

agccgggggt 

tcgcttgaat 

ctgggtgaca 

tagactctta 

ttcaatgcat 
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caacagaatt ttacagtttt tttcatgata tcctgctatt tcttataaaa tgtattccta 

gatattctgc atgttttccg gttgtttgtt aataaatatt tttcatttgt cattatttcc 

taattggctg ttatttgtat atatgacatc tgttgaattt tttgattact ttgaaaatgg 

ccattctttt gtgttttttt ttaactttct attttgagat aattttgact tacagaagat 

ttgcaaaaat agtacagaga gttcctgttt cccccttatg ttaacccagt ttctccttat 

gttaacatct tacataacta cagaacaatt gtcaaatcta agaatcaacc tgggcacaat 

gctattaact aaactgcaga agctgttcag atctcaccag ttcttctact gctccccttt 

tctcttccag tgttcaatcc ggaatcctac attatattta gttgtcattt ctctttggtg 

tcttccaatc tgtgacagtt cctcagtctt tctttgtctt tcatgacttt cattttttta 

tacttttgaa aaatactggc cggttgtttt gtagaacgcc ctcagtttgg gtttgcctga 

agttttttgt gattagatcg aggtcatgca ttattggaga gggtgccacc gcctcgatgt 

gcaagctcaa tgcatcatat cagagggttt gtaatgtcag tttataccgc cggagaccct 

aacctggagc atttcgtgaa ggtgctgtct gccaggattc tccactagaa agttactatt 

tttccctttt taattactga atgtctgagg ggaaatactt tgagactatg caaatatcct 

gtttctgctt taacttcggc tcactaagtt tagcattcat ctatggatct cgcttatagc 

aagtattact gtggagttct aatggtaatt ttctgtttct ctcattcctt caacctttat 

taatatgctt cttcctcact tattcatttt gtttcagttg tttataccaa catggatttg 

tggatattgg ttttattctt tgggttgcaa ttgaatccta tcattatttt gttagtcagt 

tgttccatcc gaccttggtc attaggagcc cttgaaattt ggctcccatg cctttttttt 

tttttttgag accgagtctc actctgtcac ccaggtttga gtgcagtggc atgatcttgg 

cttcctgcaa cctccgcctc ccaggttcaa gcaattctcc tgcctcagcc tcctgagtag 

ctggtattat aggcgctcca ccaccttgcc cggctaattt tttgtatttt tagtagagat 

ggggttttat tatgttggcc aggctggtct caaactcctg acctcaggtg atctgcccgc 

ctcggcctcc caaagtgctg ggactacagg cgtgagccac cacacctggc ctcctatgcc 

attttaacat gcccgtcttt tcttfcttctt tcctactttc tgtgactgta agaagctcca 

ggatacattt ttgctgccct agacttagcc tcaatcagtt ctcagaaaag ctctggttct 

ttttatggga tacttagaaa actagctctg tatggcctgg cgcggtggct cacgcctgta 

atcccagtac tttgggaggc cgaggtgggc agatcacaga tcacgaagtc aggagatcaa 

gaccatcctg gctaacatgg tgaaactctg tctctactaa acatacaaaa aattagtcca 

ggcgcggtgg cgggcgcctg tagtcccagc tactcaggag gctgaggcag gagaacggca 

tgaacccggg aggcggagct tgcagtgagc cgagatcggc agccactgca ctccagcctg 

ggccacagag cgagactccg tctcaaaaaa aaaaaaagga aaaagaaaaa agaaaactag 

ctctgtatgc tagttttttt tttaagacag ggtctctctt gccccagctg gagtgtagca 

gcacgatcac agctcactgt agcctcaacc ttctgggctc aagcaatcct cctgcctcag 

tctcctaagt agctgggtct acaggcatgc accaccgtac gtggcaattt ttaaaaactg 

tttgtagaga tggagtctcc ctatgttgcc tggtctggaa ctcctggcct caagtgatcc 

tcctgcctcg gcctcccaaa gtgctgagat tacaggcatg agccactgta cctggcctgg 

ccaaggtctg tcttttttta aaagaagttg ttgtatagtt gttttttttt ttattttttt 

ttctgagacg gagtctcgct ctgtcgccca ggctggagtg cagtggtgcg atctcggctc 

actgcaagct ccgcctccca ggttcacgcc attctcctgc ctcagcctcc cgagtagctg 

ggcctacagg cgcccgctac cacgcccggc taattttttg catttttagt agagacgggg 

tttcaccgtg ttagccagga tggtctcgat ctcctgacct cgtgatccgc ccgcctcggc 

ctcccaaagt gctgggatta caggcgtgag ccaccgcgcc cggcctgttg tatagttttt 

atctcgagtt ttctagcgat ttaatcatat tggttacaaa aaaggatgat tttactacct 

cctttccaat gtttctacat attttttcat tttatctaac tgcattttaa aataaacttt 

taattttaga atggtttcat atttacagaa aatgtgcaaa gatagtacag agagttcctg 

tgtactccac acccggtttc cttattatta tcttaacgtg atacacaatt aataaaccag 

taacattatt attcactgaa gtccacactt tctttttttt tttttctgag acggagtcta 

cttctgtcac ccaggctgga gtgcagtggc gcaatctcgg ctcactgcaa cctccacctc 

ctgggttcag gcaattctgt ggctcagcat cccaagtagc tgggaataca ggtgcccgcc 

accacgcccg gctaattttt tgtattttta gtagagatgg ggtttcacca tgttagccag 

gatggtcttg aactcctgac ctcgtgatct gcctgcctca gcctcccaaa gtgctgggat 

tacaggcgtg agccaccgcg cccggcgtcc atactttctt tagatatcct tcctttttac 

ctaacgtcct tcttctggtt caggatccca tccagaaagc aacattaccc ctcgccatca 
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cgtcttcaca 
cttgacagtt 
ctgagaccga 
gttcaagacg 
catcctctgg 
gaatgaactc 
ctgagaaaca 
atactcttaa 
atttaccata 
aaggataaac 
atcaaaacaa 
tttagatgga 
tgcaagctct 
actgcaggtg 
tcaccgtgtt 
cccaaagtgc 
ctttaatgcc 
ggttgaagtt 
attttgattc 
aagttgtagt 
ctgagttttg 
tcctaatact 
aataaatcat 
cgccatgtaa 
caaccacata 
cagtggacct 
tgggcagcaa 
ttgtgcccat 
gctgaaaaca 
aggtgggcta 
atcaagtttg 
aactggaaac 
agtcatgaaa 
taaacttgca 
ttcctagtcc 
tcccagagga 
ttcttcctct 
ataaggtctc 
ccctcccagg 
catgccacca 
agggctgatc 
ggattcgggt 
gtacagtgtt 
gatcacactg 
ttgtctcaaa 
ggaaagcttt 
aaggtgggct 
ttccctgggt 
atcacgttct 
tggctcatgc 
gaggatttgg 
aatacaaaaa 
gaggcaggag 
caatttcatc 



ggctcccctt 
ttgaggagga 
tacatgatac 
aagttgctgg 
agaagaagaa 
cctttgccaa 
tcaagcttta 
aacgttttga 
tcaaaaatta 
ccattacatg 
aaattagtga 
atcttattct 
gcctcccagg 
cctgccacca 
agccaggatg 
tgggattaca 
tgccttctca 
tgtaaagaaa 
tctcttcaaa 
aaatacaggt 
taacattaaa 
ggaattagta 
gtgatactgg 
tacattggca 
cttacagagc 
tggttcttgc 
ctggaccatc 
gcgggttaag 
agcctttgag 
acccgaaatg 
agcatttcgt 
aaacagatca 
gcagactcta 
tgggcaggaa 
ccagtataca 
tgaggctagg 
ggaagtccta 
gctctgttgc 
ctcaagtgat 
tgcttggcta 
tcaacctcct 
gcacaccacc 
agaaatggat 
tgaaccatga 
aaaaaaaaaa 
atagtggaca 
aactagagat 
gatcctaagt 
ggcagtagtg 
ctgtaatcct 
gaggtcagga 
ttagctgggc 
aatcagttga 
tcaaaaaaaa 



gacgggaaga 
ctggtatctt 
atgaaaaaaa 
ttggtttggt 
tgcggtgtcc 
gccattttat 
aagcactggt 
ggatcccaaa 
aactgaaaaa 
ctaacataaa 
gaagagtggc 
gtcgcccagg 
ttcacaccat 
cgcccggcta 
gtcttgatct 
ggcttgagcc 
tatttgtttc 
atgtggcctc 
caactatgga 
taggtgtagt 
gtccagttgc 
tcagaagtgg 
ctcagaagtc 
accatttgat 
ctgtagacat 
cagttgcatt 
ttcaagtaaa 
aaaaatccag 
tggtaaaggc 
aaatgagaag 
ttaggagagt 
gatgtctacc 
tactgattag 
gtgggctgtc 
cttcagatgt 
gcttcatttc 
tcatcgttat 
ccaggctgga 
cctcctgcct 
tttttttttt 
gggttcaaga 
atgccaacta 
tgcttgcaga 
tcgcacccct 
aagagagaga 
aaaaggaacg 
agggggtact 
tggaagcagg 
gactggttgt 
agccctttca 
gttagcctgg 
gtggtggtgc 
acctggggag 
aaaaaaagtt 



gttcctcaga 
agtctgtttt 
atgtattctt 
ctctggtttc 
tctcactgca 
aatgggcatt 
tctcaacctt 
aaaaggcttc 
attttaaatt 
tcatgtattt 
atgtataatt 
ctggagtgca 
tctcctgcct 
attttttgta 
cctgaccttg 
actgcgtctg 
tgcattcaag 
atacagttgt 
caacctgctg 
gtggaatctg 
gttaagctac 
ggtgctactg 
aggcagtagg 
ccagctgtct 
aggggaaaat 
tagccaaata 
agtgaaaggt 
ttgcttctag 
cgattaacac 
cctgtggtga 
ttgggctctg 
acttcttcga 
gcattaaaac 
aaagctgttc 
ggccctggag 
tccaatgacc 
tattattatt 
gtgcagtgac 
cagcctcctg 
cagtagagat 
gatcctccta 
atttttaatt 
ggcaggagga 
gcactccaat 
gagagagact 
ctctaagtct 
atgtggttga 
gacaaaaatt 
gatagaagtt 
gagttcaacg 
ctaacctggc 
atgcctataa 
gcggaggttg 
atcgtttagc 



ctttccttgt 
gtgctgctat 
acagttgtgg 
aagatggcgc 
gaagatggaa 
aatccacaaa 
tttggtctca 
tacaggttcc 
atttattcat 
tatgaaaaat 
ttttttgttt 
gtggtgtgat 
cagcctcctg 
tttttagtag 
tgatccaccc 
gcctaaattt 
ttattgcaaa 
gtagttggaa 
ttacaaaacc 
aaagcatgtg 
tgtgatagca 
ttaataaata 
ctgtgtggaa 
gtcatgatga 
agtataaaac 
ttaaacaaaa 
aataaacaga 
acaccgtata 
tcagcgcggt 
tgaggaggca 
attacttgca 
gggaattgca 
aaaaacaatc 
atcctctaag 
gacactggac 
tcagctgcct 
atcatcattt 
atgatcatgg 
agtagctggg 
agggctctca 
cctcagctcc 
tttttttgta 
tcacttgagc 
ctgggcaaca 
caaagatagg 
gccctattgg 
ctatgggtgc 
agggaagctg 
attgttttgg 
tgggtggatc 
gaaatcccat 
tcccagctac 
cagtgagcca 
ttcctcgatt 



ttttgttgac 
cacagactag 
aggctgggaa 
cttgctgctg 
gcgctaaaag 
ggatgaaacc 
ggagcccttt 
atcttttaat 
ttaaaataac 
agctatattt 
attttttgtt 
ctcggctcac 
agtagctggg 
agatggagtt 
gcctcagcct 
ttgtgaatgt 
atgttgtgtt 
aggcaagagt 
agaatgcaaa 
aatgaacttt 
tatagcattg 
aaaagaaata 
cctgacatca 
cttggaaagt 
agaatactaa 
gagatattct 
gtccagacat 
tgaaaacaac 
aacaaagacc 
gagaagtaaa 
catgcaaacg 
ttgccaaaga 
tttaggcccc 
gtggacctag 
atggaggacc 
ctatttcccc 
ttattttgag 
ctcactgcag 
agtacaggca 
ctatgttgcc 
tgagtagctg 
tggacaggat 
ccaggagttt 
gag t gaga cc 
caaaaaagtg 
catggtgctg 
atcttfcggct 
ttagttattc 
ccaggtgcgg 
aggaaggagg 
ctctactaaa 
tcgggacgct 
agatcgtgcc 
gttactggac 
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gtagtaatct 
tgtagataag 
cgtaatcaag 
aaaaaatcag 
acacgtctag 
agtgaccata 
accgtgtttt 
accccatgta 
ttcaaatagg 
actgtggggt 
aggagcgggg 
gagttcaaga 
ttatccaggc 
cagtgagccg 
aagaaagaaa 
ttttgggagg 
acggtgaaac 
cctgtagtcc 
agcttgcagt 
tgtctcaaaa 
atgacaaata 
aagagagtgg 
tgggcagatc 
tctttgctga 
cacttgagcc 
tgggtgacag 
aaatctgtaa 
tggccaacaa 
gcatgtgcct 
tacgtcgtta 
aactacagtg 
gtagatactt 
cccctcctct 
tctcgaggcg 
ttgggttttt 
tctcgtggtt 
acccccccaa 



ggcttcctgc 
gcagtaagta 
cagaagattt 
taaggatgta 
aaccctgcag 
tggtggacca 
ctgaccgtaa 
tctggaaatt 
aatttagaaa 
gcagctgaac 
aggattgaaa 
tcagcctggc 
gtagtgaggc 
cgattgcgcc 
aaaaaaaaag 
ccgaggtggg 
cccgcctcta 
cagctactca 
gagccgagat 
aaaaaaaaaa 
aaccaaaaaa 
ctgggtgcag 
acttgagccc 
aaataaaaat 
taggaggttg 
agtgggaccc 
tcccaccact 
gatgaaacct 
gtaatcccag 
ttggactgaa 
gcttccgacc 
ccttccttct 
ttatcttttt 
gtcctttcct 
atctctcacc 
tctcttcctc 
gcgctttgcc 



aagtctaact 
aagcaaaaat 
gaacacttca 
gaagatttga 
aacacacact 
taaagcagtt 
gtacagttaa 
ttaatataca 
taccttgaac 
agtatataga 
gttaatggga 
taacagggtg 
tgaggcaaga 
actgcacccc 
aaaaggccag 
cggatcacga 
ctaaaaatac 
ggaggctgag 
cgcgccactg 
gttaatggga 
aaaaaaatca 
tggctcaggc 
aggagttcaa 
aaaaaattag 
aggctgcagt 
tgtctctaaa 
ttgggaggcc 
catctctact 
ctacttagga 
ttcagaatga 
ccactctggt 
ttctcacttg 
ctcgcgacac 
ttatccagct 
ctccctctaa 
tactctcggg 
ttttttttct 



ttcagcagac 
tgatagagca 
ctttcagtaa 
acaacgtaat 
ttttcaagca 
tcaacaaatc 
cctagaaatt 
ctttgaaata 
tgaaaaataa 
cgaaaagtat 
ggcaaaagca 
aaaccccatc 
gaatcgttgg 
agcctgggag 
gcgcggtggc 
ggtcaggaga 
aaaaaaatta 
gcaggagaat 
tactccagcc 
taaacatcca 
aaagaagaaa 
ctgtaatctc 
gaccagcctg 
ccaggcatgg 
aagccatgat 
aaactaaaat 
aaggctgagg 
aaaaatacaa 
ggnnnnctnt 
tgactctcat 
tttcacttca 
ttgctcttcc 
ctgcgttctc 
taaatacctt 
tttctttcct 
tcccccccct 
ttgcccttta 



tggctacatg 
tcaaggataa 
ctgataaaac 
taacaaactt 
tactcagaac 
tcacaggagt 
gaaaacaaaa 
acaaatggat 
tgagaatact 
actcatatgt 
ggtggatcac 
tctactaaaa 
aacccaggag 
acagagcgag 
tcatgcctgt 
tcgagactat 
gccaggcgtg 
gtcatgaacc 
tgggcaacag 
tctcaagaag 
atcataaggt 
agcattttgg 
agtaacatag 
tggtactgag 
tgtgccactg 
aaggctgggc 
tcagcagttt 
aaaattagtt 
ngattatatt 
tggagctctt 
cccctctgct 
tcaacccccc 
ttgccctctt 
ctcctctgtt 
ctttccgcac 
ctccccttct 
ttcccccc 



ggctgtgtac 
atagaaaatc 
aagtagacaa 
gacttgattt 
atttatataa 
aaaataacag 
agctagaaaa 
cagagattaa 
ataccccaaa 
gcatacctta 
ttgaggttag 
atacaaaaaa 
gcagaggttg 
actccatctc 
aatcccagca 
cctggctagc 
gtggcgggtg 
caggaggcag 
agagagactc 
ttagaaagga 
caagactata 
gaagcagagg 
agagacctca 
gtgggaggat 
cacttcagcc 
gcggtggctc 
gagaacagct 
gggtgtggtg 
ttctccttcc 
cctgtctcct 
gctcatacga 
ccgttggtgt 
atcatccctt 
tatttggggg 
ccatcaagcc 
ttttttcttc 
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<210> 7 

<211> 29430 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 

<222> (4336) , (4345) , (4349) , (4392) , (4447) , (4490) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 7 

aggggaaggg ccggctccgt agctcacacc tataatccca gcactttccg aggagagagg 60 

atcatctcag gccaggagtt caagaccagc ctgggcaaca cagcaagacc gcatctctac 12 0 

aaaaacttct tttaaagctt aaaaaaaaaa aaaaaagcaa agaggacagt tcaggagaaa 18 0 

agcctgtaga ggcagcacac taaggaggag acgcagccca ggcaccagga ggggctggcc 240 

atgggcactc actcctccag caggcgagtg cccagcacca gctggcccac ccagacaccc 3 00 
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aggacacggc ctgaatggct ccgtattcac gtgggtggta ataaacaagc aatacacata 360 

gccaataagg acaccttagt aatgttacat cataaacgct gcagatcagg gaaatggtgc 42 0 

agggtgaagt gggttggggg gctgcatgct acatgagaag tgggtcgggg ggctgcatgc 4 80 

tacctgagac agagcaggcc ttgctgggaa agaaggagcc ggcaggcctg ggcaaaggtc 540 

ctggggtggg agcacactgg agcagagtgt gggggtagca tggcgggtgc tggtcctctg 600 

ggcgccttcc caccacgtca tgtgcccatg tgcccaaggt ctctcgtttc acagccccct 660 

gaagctcagg ggtcacagct acacagcccc cagatacctt ggcctgcccc aggtcattcc 720 

atccagtgat ggacctgctg acctctagcc tgacctctgg gcagcgtaat ttgagaagga 780 

ggagaaggga gggcaacaga cctggggcga tgagggatgc acagggtggc agacacctga 84 0 

ggctgcacct tggagcctca gttctgggtg tgggtggggg atggacaggc tgagggctga 900 

agcagctggg cccggccacc atcacacccc aggacccacc agatcaccat gaaaaaccga 960 

atgfccaactg gcagcccaga gtgcagaaca aacctttcag aaacacggtg gtgactgccg 102 0 

catcatgaac ataaaataat tacgccctct ccccagggat cacccctgca ggagtttgtc 1080 

ccaagaaaca ccagaaagaa ggaaaacgtc tgagtcacaa tatttgctga ggccttattt 114 0 

gtaatagcaa aaaaaaaaaa aaaaaaagaa caatctccag cggcaggggt aactagacta 120 0 

ttgtctccgt ggaaaggtag caccaattaa ctagtaacaa aatgactgcg gtaacaacaa 12 60 

aacgttcgac atgtcaacac caaaaaccac acacccagca taaccgtgaa ccatgatttc 1320 

tactagaatg aatggcagtt atgagaaagc accagcggag acaaagattg aaaaagtaaa 13 8 0 

ggtggcctca ttagggagac aagtctctgg gtaatatatt gtaatactgg taaatatata 144 0 

gtttttaata tattttttaa ttccaaattc catatatgtt cctatgaagc tatttctgca 1500 

aatatttttt tcaggaccgt acatcacaaa ggcaaaaggg ccaggtcagc tctccagctg 1560 

agagtgacca cttcagagca gacggcagac tccagggtta gcaagcctgg ctgagacctg 162 0 

gcccatgaca atcactcaac ccctctgacc tcaacatcct gtctgtgaaa tggggataat 1680 

tactgcacct ccacatcaca gagtgcgagg cttaaacagg atgcttcata gaaaagcgct 1740 

caagaggtaa cagccgggag ggggtagtgg ttttcattaa ttaaatgttg ccttcatcca 1800 

gccctgggcc agctccaaca caaagcacac accatccact cagactcagt tgcctggatt 1860 

caaagcccgg cctggcctcc agctgtgaga ttccgggcag gatttcccat ctcccagagc 192 0 

ctcagtttcc tcattcatga aacaggaagt gatcattcct tttattttta tttttatttt 1980 

tattttgaga cggagtttca ctctagttgc ccaggctgga gtatgatggc gcaatctcag 2 040 

ctcactgcaa cctcggcctc ccagtttcaa gcgattctcc cacctcagtc tcctgagtag 210 0 

ctgggattac aggcacacgc caccacgccc agctaatttt gtatttttag tagagacggg 2160 

gttttgccat gttggtcagg ctggtctcga actcctgacc tcaggtgatc cgcccgcctt 2220 

ggcatcccaa agtgctggga ttacaggtgt gagccaccaa gcccagttga caactgcttt 228 0 

taaagacacc tctggctgct gtggaaaaca gcctggtagt gcctcaaaaa gttacacata 2340 

gaatgatcct atgaccagta attccactcc tacatatata cccaaaagaa ctgaacccct 2400 

ctactcatgt atgtacacat acaggtacac gcatgttaac agcagtgttc acaaagccaa 2460 

aacatggaaa cagctcaaat gtccataacc gatgaacgga taaatgaaac gtagtctatt 2520 

caccacctga cggaggtgag aggggccata aaaaggaatg atgcataaaa acgaatatta 25 80 

tggccaggta tggtggctca cgcctgtaat cccaggactt tgggaggctg aggcgggcgg 2 64 0 

atcacgaggt aaggagttcg agaccagcct ggccaacacg gtgaaacccc atctctacta 2700 

aaaatacaca aattagctgg gcatggtgga gggcgcctgt aataccagct actccggagg 2760 

ctgaggcaag agaatccctt gaacctggga aacagaggtt gcagtgagct gagattgcac 282 0 

cactgcactc cagcctgggc gacagaccaa aactccgttt cggaaaaaaa agaaaaaatt 2 880 

agccaggtgt ggtggcgggt gggtccctgt aatcccagct ctacttggga tactgaggca 2 94 0 

ggagaaccac ttgaacccgg gaggtggagg tagcggtgag ctgagattgt gccactgcgc 3000 

tccagcctgt gtgacagaag gagactctgt ctctaaaaaa caaaaacaaa aaaggcccga 3 060 

cgcggtgtct tacacctgta atgccaacac tttgggaagc caaggcaggc agatcatctg 312 0 

aggtcaggag tttgagagca gcctgggcaa cacggtgaaa ccccatctct actaaaaata 3180 

cagaaattag ccaggtgtgg tggcacatgc ctgtaatccc agctactcgg gaggctgagg 324 0 

caggagaatc gcttgaaccc aggaagcgga ggttgcagtg agccgacatt gcaccattat 3 3 00 

actccagcct gggtgacaga gtgagattct gtctcaaaaa aaaaaaaaaa aaaaaaaaaa 3360 

ctaaacaaaa gcaaaaaaac caatgagtaa tgttgtcaag tgaacttcat cccaatggga 342 0 

atgcagataa tttgtttaaa aggcaccatg cacactgggc aggctggctt cccctgggaa 3480 

cgtcttcttt tgcctggatt cccagttggt ttaatcgggc gtagaacact ttcttcaatc 3540 



-31- 



BNSDOC1D: <WO 02092000A2_I_> 




WO 02/092000 





PCT/US02/14877 



cgggattcag gcacccctgc tcagcacaaa ctcagtacac cccgcactct gctgtgggtt 
cttggcacta ttaggagaat gtgagggggt gattcagatc tatctctagt gggtgcatgt 
ctgccactcc caggaacgcc cacttctggc aagtcagtgt cagagaaagg ccagctcgtg 
gcccctcctg ccttgagtcc caggacccgt gatcagtcct acccggagca gaatcaggag 
tttgaaaacc caagtgccaa caatctcatt ttaacccatg taagcatatc caatatttat 
atatagaatt cataacagat gtctgggctt ccattccaat agcctatatt ttacactgtt 
tatttacatg gttacaccaa acaagactca attcaaggta acccaatcct ttgctactat 
accaaaataa gcaacatttt cagtccatgc cttatatata ttcaccaagc attacactag 
gcctccaact gctcatcgga gcaagctgca gcctggacac aagctagaga ttaatcagtc 
aggaatgatc ctgcgtccag tgccagcatg atggaagaga cagagaaaca gaagacatca 
gggctccaga gtcaaggagc ctgcaggtta gttgggcagg atatacacac atacacacac 
acacgcacac acaaaaccac ccaagaagaa aaggtgggat gaatgcatgg acaggtaatg 
cctggagcct ggggatggat aagctgactg caggtggccc aggcaggctt cctggaggaa 
gaagacctgg ctgtangtgg ggtangcang ctttctaaat ggggaaaatc tggctgtggg 
tggagttggc angtttccga aaagaagaaa agctgactat gggtacacct ggctgttggt 
ggaacangca ggcttcttgg aagaagaaaa tctggctgtg ggtggatcan gcaagcttct 
tggaagaagt aaacctgact atgggtggac caggcaggct tcctagagga agaagaccgg 
ctgtgggtga accaggcagg cttcctagac agaggaagat ctggctgcgg ttagagtggg 
caggcttcta agaagaggaa gggctgactg tgggtagacc tggctgtggg tagactgggc 
aggcttcctg gaggaggaag agctggagca ttgaaaaaca aacatgactt ggtgaatgtt 
gagcatgccc aggcctgatc cccagaggca attacgcact caagttactt aattctactc 
acaatgcctc acaaacaact tctctgacac ctaacacagc tctgggcacc ttctagcttc 
agctcctcaa agcagttatt cacgctacta ccctgcacac ctcctcacac cccaacccca 
gggacaggag ttctgccaga tgccaaagct cctgatgcca aagcctgggt ctgcttccgg 
gctcctcttg gtctaactgt ccaccccgca tcggcatgat gtgcaaaaac aaggctttgc 
aatctgccct gatgcctggc ggagcgagtc cctcccgatt cgtctccttc agaaacacct 
gggctgccct ggtcctgtta tacccccaac acattctaca gtcagctccg caagttccac 
aaagatcaac gctggcgttt ttatggcatt ttatttacag tttttacaat ataaaaaagg 
aaggatgcca cagctcagcc agcaggacag acagagatct atgatgcttc tgctgcacca 
ttgtttgtgg tcaagaaagt ctgttttcaa tgatttatta aattgtggtg ggagatggat 
ggtggcagtg gttaccagca acatgaatgt tcttaatgcc actgaacttc acacttacaa 
atggttacga cgataagtgt tatatgtatt ttaccacaat taaaaacagg taaatgcagg 
ccgggcacgg tggctcacga ctgtaatctc agcactttgg gaggccaagg caggcagatc 
acctgaggtc aggggttcga gaccagtctc gccaacacgg tgaaactctg tctctattaa 
aaatacaaaa attagccaga tgtggtggtg catgcctgta atcccagctt ctcaggaggc 
tgaggcagga aaatagcttg aaaccgggag gcagaggttg ccatgagctg agattgtacc 
attgcactcc agcctgggtg acaaaagcaa aactctgtct caaaaaaata aaataaaata 
aaaataggta aatgcaaaca tatggtatag taatattatg ggctattatg agctacaaaa 
aagaatgact tgggactaca gttacagccc tcattcagga atttgtttta aatgtgggtt 
ggtcgctaag gcatgtacac aacattttga cgttcaaata ttcctagatt tggacagtga 
gcacccctct aagctggctc ttctgtccca gaggtcccca ccagtcctcc agaacttctt 
tgctttctta cacaataaga tgccccatgc tcggcttgta cctttccttg ccccagccct 
agaaccagct tcttcgtgga caagctctga ctcctttggg tggagaatgg tattcagaaa 
cccagacctg ggctctggtg tgctcactgc tacttggggt cattgcttct aggcctctct 
gctgatggag gtaggatata cacgtacagt cttccctctt cccagattcc gtacttgagc 
tcgcctactt gctaacattt atttatatcc cccaaattaa acctcacagc acttctgcaa 
tcactcactg acttgcagag tgtgaaaaaa ctgagtcacc atcacacgtt ccaaactgag 
gtcaactgag gccacaacgc cccatcttct tgctccggct gtcgagatgt aagcaagtgt 
ccttctctcg gtctagctag tgccatgctt tccacatcac tgtgcttttt gtgggcaatt 
ttgctgtata aaatgtcccc tgcacatatg ctgctgtgta gtgctcctag gtgcatgagg 
ctgccccacg ccttacagag agaatatgca tgagaggctt tattcaggta tgagttatag 
cgtagttggc catgaattca atgttaatga atcaacaata tacagtaaat aaggtgcttt 
ttagagacag ggtctcactc tgtcacccag gctttagagt ccagtggtgt gaccttggct 
cactgccgcc tcaacctcct gggctcaagt gatcctccca cctcagcctc ccaaactgtt 
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gggattacag gcgtgagcta ctgcactcag cctaaataag gtgtcttaga aacacacata 6840 

agacaaggtt atgggctgag tgcggtggct catgcctgta atcccaacac tttgggaggc 6900 

caaggtggga ggttcacttg aggccagaag tttgagacta gcctgggcaa catggcaaga 6960 

cctcatctgt atattttttt aaatcagaca ggtgtggtgg tgcatgccta tagtcccagc 7 02 0 

tactggagag gctgaggcag gaaaatggcc tgagcccagg aggtcaaggc tgcagtgacc 70 80 

catgattgta ccactgcatt ccagcctggg gtgacacagc aagacgctgt cttaaaaaaa 714 0 

aaaaaaaaaa aagccaggtc aggtatcgaa cagttggcaa aaacgttgtg acctgaggct 7200 

cacaggaacc tagcccgatg tttcccctag gagcaatggt tcagtattca ataattcagg 72 6 0 

gttcccagtg actttatgga gcataacttt caagaataac aagaaccaac tgtacgtgtg 732 0 

tatgtatact cacactttta ttttatttta ttttattttt tgagacagag tctcactctg 7380 

tcacccaggc tggagtaaaa tggcgtgatc tcgactcact gcaacctccg cctcccaggt 744 0 

tcaagtgatt ctcagcctcc caagtagctg ggattacagg tgtgccccca caaccggcta 7500 

atttctgtat ttttagtaga gacggagttt cgccacattg gccacgctgg tctcaaactc 7560 

ctaacctcaa gtgatccacc cacctcagcc tcccaaagtg ctggaattac aggcatgagc 7 62 0 

tgccgtgcct agcctacata cacttttata cacacatgca tctatgacta tttctctatt 7680 

tctgtgcatg tgtgcgtggc agtacctaca gtttcagcta tgtgtctggg tactgtctcg 7 74 0 

tccaagtttg taagcacctt ctccaaagtg caaagcctgg cttgtgttac tatccatatg 7800 

tttacttatt tgctcaatca atttacttat tagctccata accagcttcc catctgctcc 7860 

agtagcctct gctgtcagtc acctctgcac cctaccccac cttgcttccg gatgctggat 792 0 

gccaatcacc cccgacacct ctacatagca ccaccctcga catgctgctt ctttatttct 7980 

tatttatttg tttgagatgg agtcttactc tgttgcccag gctggagtgc agtggcacga 8 04 0 

tccaggctca ctgcaacgtc cgcctcctgg gttcaagtga ttctcctgcc tcagcttctc 8100 

aaatagctgg gattacaggt gcccaccacc acgcccagct aatttttgta tttttagtag 8160 

agatggggtt tcaccatgtt ggccaggctg gtctcgaact cctgacctca agtgatccac 822 0 

cttggcctct caaagtgctg ggattacagg tgtgagccac cgcgcctggt ctgcttcttt 8280 

aaatgccagg caccaacatt tgtgcaatgg ggtgggagga aagaacaggg aggagagcac 834 0 

actgccggcc cctgcactga atccactgat caatctgggg gcaactgcca tctccatctc 8400 

ctgtcttcct atccgtgaac atctactgca gtcctctcca atgtccttct gtaaagttgt 8460 

attatgtttt gcatacaggc cttgcatatt agttctcaga tataatccat atactttata 8520 

taaaattcaa accacattta aaaaaataaa actagcatga ctataacgga gtctgcaaca 8 58 0 

ttctcacaga ctttatgata aaacatgaaa cttcaaagat acttagggtg gggcagggac 864 0 

aatgtttaag gctgcctgga agcctcccca tccctgagcc agaaagtcct atctcccctt 8700 

caaggggaaa tgcttgaaaa agcactgatc aggctaaaat gacagggatc agggagtaat 876 0 

caaagtacaa gtgagctggt ctcctccatt ctgagcacag caaagttcag tctctccaag 8 820 

tccaagaatc atacacctgt ttgccaagaa tgaagttcag gtgtctacaa gtggctgaaa 88 80 

atattcattg ctgggccatt aacaacattc ttggcaaaac cataccttag cttctcgtgg 894 0 

aaatttctta aggtagaaga aacaggaaac acccaggctc gcttttatgt agacagttcc 90 00 

atgaagccag ggaccttccc cacatccacg tttcaattac ctgcacgcag ctcacagtgt 9060 

attcaacatc tacgcgtctc tcctactggg gtggcggtgg ccactcaaac cctcatgcag 9120 

ctacgatgac cgcaattttg gcaacataat ttcatgtttt tccttgggct tttacccaag 9180 

tcagtgacac aattctgcag ttgtctaaag attcaaaatg agggacttga catttacaac 9240 

aataataaaa tcttgggttt cctttaacca agcacatgtt ctgcctttta gagaaagctc 9300 

tgcaaactca agctggagtg ggatacttgc tgacatcttc aagcacccca ggaatagctc 93 60 

tactccccca tttccacctt ggctgaacca tctatatccc accaattccc ccaacatccc 9420 

tccatccgtc catccatcca cccaaggacc tgctaagcca ggaggtctct cccatctacc 9480 

ccacagcctg gcctcagccc acaagggctc tctctacatg aatcccaccg caccagagta 954 0 

gaccaagtct cccgtagact ccaccctgac cacctccatg cctccagcca ttcccacccc 9600 

taaaaaccct ccctggtctc tacacccagc tgatgaatac ttggctgaat gtgacctggc 9 660 

ctcctggacc caggtgaagc ccacgtcctc cgtaagcccg ccagctcacc ctgcctctgc 972 0 

accttcactg gagagagccc gcacttcacc tcctcagggc aggcatggct gatgccaccc 9780 

agtggaatct ggtgcaaagc agggcccggt gcagagcagg gctgcctgca gagcaaggcc 984 0 

ctggtgctgg ggccgagcac ctccaatgct ggccgtggaa ccatccctcc cattccaggt 9900 

gctgtctcca tcaagaatga gcgagctgct gacatttgca tgacaataat gaataaatac 9960 

catattttgc ttcaaatcca gaatagatgt ggccagggtt ggcatatgac tgttgggaaa 1002 0 
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ggacagtttg cctcttccca aaccaacttg gattataaaa agcttttctt aacgaccaca 
agagcggagg agctcagggg cagacaaaag gaaggctggc tgcagaaggc gggagagtgg 
ggccttcagg ggcgggtggg gagagagaaa gcctggagct gcacccccaa ggtctgtgta 
catcaggtgc tacagaataa caccacctct tccagcttgg cccccacctg ccctctccca 
gcccagtcac ccagacagca ccccactccc cacacacacc tcacatctgc ccgcctcaca 
ctcaccagct tcggctctca atgcaacctg gaacctgccc ttggcctctc agctcagcca 
cccccattcc tgttggcccc tggcccccca tcgaattctc tctaatccta atgcacacac 
ttgcacactc aaacacacac acacacacac acacacacag cccagaggaa aaccataatt 
gactgaggtc caggcaagtt tcccgagcag ggaccacatt tcaaaggtca gggaagcagg 
cgaacaggaa acatacaggg ggcacgtttg ggggtggagc aggaaataag aaatcacttg 
caaaagataa aaagaaaatg aggtagctgg tttcagacac ctcggagcac acagaacagg 
acaggcgcct ccgggtcttc cctcaacagg gagatgggcc aggcaggtcc ctgctgctcc 
accgcagagc tgggggctat ggccctgaca ccaaggccct ggggcaggcg gggaggcagc 
tgttctcctg cctgtgctcc cgggcagggc ctggccccac aagggaactg gccgaaggct 
ctgcttggct actccggaaa gtcctgggag acaagcaaag gacttgctag gtcactccaa 
acggcccaga tgtgacaact gtgaagaagc cacaccaaag caaggtgaca gaacaatgtt 
ggtgacgtca ggttatcagc ttacgctcaa ctccacttac ccggactcac ccgtaacctg 
ccgtctcttc ccaaccagta aaggatgcct aggtagaggg gcacaaggcc tggagcataa 
ttaccatttt aaaggctctg agaagtcctg cggtgaggaa gcctagttca ctttctctcc 
cctaggattt cccaactgcg cctgatcaca gaacattttt tcatttccac tcaggaaaca 
tattttgaaa aacactggcc tagaggcaga agtgaaatgg aaaacacaaa agtaaaactg 
aacaggaggc actgggcaga gaacggtcag aggcgccctg aatcctggac cggtggagat 
ccccagcttg gcatgctccc ctccctgggc ccagaccgcc tccccccatt tcctggataa 
gaaggctaat gcgcatcagg gtgaagggct tgcctgggct acacccccag gctcgcccca 
caccaatcgc gctcctgcga gagccagtga ctttcttgat ttggctactg tggaattgtt 
tgcaactaac caccccagat acagatacaa atgacaggat gatcagatgt aaaggaccca 
caggtctctg tgatacggct tcatgcagcc agcatggcta gtgccgtgca gaatgagaat 
gaccccaggc aagtccttgc ctcccagacc cagaacccca tggagcccac cagggctggt 
tcacaagcac tgtctgggtc gggcagagat tccagcaaga ggagggaaca tccatgcacc 
ggagccagtt accagaagca aatcgcctct tccaaaaccc aggctattaa tggagtccac 
tgttgagtgg agctggggtc tagctatgga atactgcaca gcagagatct t cc tgagaga 
aagcagtttt ccctgaaagc catgtgtcct ccactaactg tgttttaatt gggcgaacgt 
ctgtatctca ttgcagtggc cgcgcatgtg ctgacaaggg gctgggggcg gggtggggag 
cagaagctca ggggcctggg agggaaggaa acaggccacc agggctcccc agaaggcatg 
tatctctctc acaaacacac gcatgcacac acacgtgcac acatactctg caagccctga 
gttagcaact gtggaatgtg accagctcag tgatcccagg acaagctgct agggaatatg 
acatttgatt gatgtctgca aatgtgcgtt ttcactaatt agaaggttta gggcagagca 
gagaaaaata tgtatttcag agtcccagtt tgacctgcca gaaaccagcc cattactaac 
attcttattt tcaacaaaat atagcattct gattacatac catcttggtt ccacgcctcc 
tgccttgcca agcccccgga agcggcccaa ggccatggca aatagtgaga gaaacagttc 
cagggtggag actgactcag gggtgtcagt cagtggggcg ctgatggccg gtgggaggcc 
agcagtcatc accctctcct tgggacagtt gagtagctct cccccagggt catgtggcca 
ctcaggttca tatgggaggc gagaggagtg gcagagtcca ggagagtggc tccgaagtca 
ctgttccctc caggcctcag tgtcttcatc cattaaatgg gtaggctgag gtctgggatg 
acaaggaggg cttgcactta ctgaaaccca tgggaggctg ttcgccgatt tcttttattg 
atggaagaaa acactcgtat aattcaagta ccaattaaaa ggcaggcact ggaaccaccg 
tctgccaatt cctagttttg cctataccaa atttgagcaa gttaattgac ctctcccagc 
ctcagtttct tcgtctgtaa aatgagggta gggatggccc ccagcccaca gggcagctgg 
aaggattaaa gaaatcaaac atctcttaga gcccacctgg cacactgtga tacacaacaa 
atgttagcta tttttgtcta tgaagtctag attttatatc ttgggtgttc taaagcagga 
tacatttatt taaaaacaag gattttcatt aaacacgtac cccacagaca gcaaccccat 
ggagactgct cttaattcag gccagtatcg aaacgactct aactacaagc tttatacagg 
tctcttggct gtccttcaaa tccaactaag gtggtacttc tgaagcactg tgcacatgtg 
tgtgtgcatg cacacgtgtg ggaagggcgg gctcacggat ccctcaggta ccccacccac 
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gcagtctcaa gtcacaaagc gacagagcag ccgaggaagg tctgtgcccc actggaccct 13320 

cgtgaagcca ccaactctac ctctgcgccg tgtcctgcag actgggctac cctttgggtg 13380 

gggaccagca tttgatgcaa gaaaggcaga cagaaaagga aaagggcaag ttcgactcca 1344 0 

gataacacag acagtaccaa gccccagggt ccataaatgc cacgcagatg gaagcattta 13 500 

ctgcgaggcc acacagcaaa cgcacggatc cagggacgga ggtgcagact gcggtgcccc 13560 

tgagccatga ccctgcaaat taccaccatg ggaaaggagg ctgccaaacc ccccgacagt 13 62 0 

cggctgggct ggcacagact cgtggtttcc atcgaggtgg gaggaggtgg gacgtcccag 13680 

cccctccccc atgcccactg cagagggaag cggccgtttc ccctgtgtgg ttacaaaggt 13740 

ctcattgttc ttcctcacag ggaggaaact ggaggaccga gctcagaacg cattttagaa 13 8 00 

ctggcagaaa agaacatctg gggaaggaaa cacatttcag aaacaaacat acctttgtac 13 860 

cagcttttat tttctttaag tgttgaaaaa ataataataa taaagacatg ccaaatttat 13 92 0 

catcgctcta caaaatccct ttattgagca aaacgtggca gctctacttt caaatgatta 13980 

ctgttcctgg aaaattgcag caacgtggat gccaaggccc gaaggccgcc atcagcagcc 14 04 0 

aaacaaaaga tgccacctcg ggctccgcga cactgtacca tgccagggaa ctggacagat 14100 

ttggggaatg ccacggtttg cctttaaccc cttgcctcct ggtctcctga tgcatctcag 14160 

aggctaacat tctttgagga actggcattt cttagttgta aatatgcatg tgggtttggg 14220 

agctgcctgc aaagtccagt gttgacgatc agctttgatt tccttggaat caagtttacg 14280 

tgtcgagtct ggaagttaag aagaatttgg agaagctgag cactatggtg ttgcaggccc 14 34 0 

tgggtgaact cttccaccaa gcattcattg tggactgaca gcgtgcgagg ggctctgcag 144 00 

gcaggtgcac aggacgaaac acattccgtc cgggggaaac ctgcaggaaa gctccctctt 14460 

cttcctaagg tgccgggcct agcttcatgg gtccctaccc tccacgcctg tcacactttc 14520 

tgagtctcat gtgggagctg cttctggttc ctgacttcac tcagtcctca taggaggtgg 14580 

aactactgtc accccatttt acagatgggg agactgggca caaggggacc aagaaaccaa 14 64 0 

tgcaaagtca cacttgtggg atcagtgaca ggggagatca attcccaggt tctttctgca 14700 

agagttaaat tgttttcatg ctgcctaagg gggggcaact gaaagaccac tgcatatctt 14760 

tgccaaaagg gtcaagcaca ggagccgcag ccagtgggtc agatccgcag aggcgctggg 1482 0 

gtgaccctcc ccatacctgg agggatgctt gtcccctcct ggccttcact gggtcccctc 14880 

atgaccgtgg cctcccagga cctcagcaca atcccggtcc tgtgctccag gacaagccct 14940 

ccgtccccaa gactgtgagg aaatggaacg aagaggggct cgctgcagcc cagcacccac 15000 

actgcccctt ctcaggggca agaaccgtcc tggaggactt ggctttggag ggggagcctg 15 060 

ggaggccagt aagtcaacaa gcctctactg ctcatgggtg ggatcccacc gcaggccccc 15120 

acctgctggg gcgggcaggg acgggcggca cagcttggcc agggcagata acccccacct 15180 

tggccagggc gaaggcagga cacgtgggct ccagcctggc cccaccatcc ctgcacaaca 1524 0 

ctgggcaaag tccacgtttt cctcaactgg gtgttgacat ctgcaggaca ggggcatgga 15300 

ggtacagagc gctgaagcca cacagcaacc taggagcgag actccatgcc tccccgggga 15360 

cccctcccca ccatgaggac catgaaggct tcccatgtgc cgcaaggact ctggtgtgga 1542 0 

gacacacgtc tcctacacag ccaggcctaa cgctcttgta actgggtggt cccacctggg 15480 

ctcacagctg gagggccagg agctcaaggc ttcgcagggt ctgctctcat cccagaggcg 1554 0 

atggggagcc acagcaggct gcaggagaga gggtgggccc cctccacttc agaggcccca 15600 

tctggcccac agactggaga gcacatctct cagcaaccac ggagcgccaa ctgcgcacag 15660 

ggcctggtcg tcagagcggg gcaaaggcac tgaccgtcac ggccagggcg agggaagacg 1572 0 

ggtgggcagg gaccttgggc agagggggaa gaacctggtg cccaggctgg ccctgccttc 1578 0 

agcagtgaag ctgagtgggg aggcgctgat gcagggggcc agaaagggct gctggtcagc 15840 

cgggaggagc cccccacaga ggaagcagcc agcccagacg cagatggcag ggtcccctca 1590 0 

acaatgtcct ctgaaaagga gaggcgggga ctgctctggt gacacctaca aatagatagt 15960 

cagccctcag ccccctgcca tacttctgac aaagcagagg cccccagggg aggcgcaccc 16020 

gaaggtacct gcacctgtcc cccagactcc tagagcccac ctgaccccat cccaccaggg 160 80 

ctccagctac aaaataaatg ccgaggccag ctaggcaagg acgcacactc ggtaccgact 1614 0 

gaataggctc cacgttgtca tgagcgcaac ccacaggcca ccaggccaca ctatgcagag 1620 0 

ctgagatggt ttcggccaag cagcctctca gctgagctga acaagtccag agtccccggg 16260 

gggtcgtcac tatggagtaa caattgcgat gcgatggtaa ccctaacagc taaccgtcac 1632 0 

tgagccaggc cctgagctag gtacttttca acgctgcctc tctgcagcct caggacgagc 163 8 0 

ctgtgggagc ataaagatca ttccctatca cggatgggga aactgagctc tgaagcagtt 1644 0 

aacgtgcttg tcccagaccg cagagctagg agcaggacac aacagcaggt caggcaggaa 16500 
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cgggtgaggg gggcctgcat gggcttctct ggaggctgcg catacacgca acccccagga 
ccccgaccct gcacctgcag ctcgctactg ccccctcagt gactccagca aacctcgggg 
taggggaagg aggctgggaa tacctcgggt gtccgaaaca gcagcttctg cttggaggcc 
actgctgcat aatggttgct gcccagcaca ccccaagcca cctgtgccac ctgtggtgac 
cttccagcat gccttggtga ccaagctggc cttaggtgct gtgggcagcc aagaatagaa 
cagggcccac ccctcctctt cacactaaca caaagcaaga ggcgggcact tcgactgagt 
gcatccctct agctcaaggg cctcacggat cacaggggtc agggcaagat cccaattctg 
cattcccgtc tgcctttcat cctgctctgc caacaacagc cagtgaggct ggggacatcc 
ctgaacctgt ttctcacctg aaacacatca taccattgga ccccagccct ccgggagagg 
ccctaatccc tgactgtggt gagatcagat cactggttaa gtacccagaa gggccttggt 
caggggctcc aggggtgggg ggtgatgggc gtggtggtat cccgctctgg gctatagtcc 
accctgatgg aggaggtctg tggtcagaac cgggctgtgc agggcacagg agcccagagg 
gacccccaga gctcacctgg tggtctctga gcagggctcc ctcaaccctc agagaaaagc 
acagcaagga ggccgcccag agcccagcgc ctagcaccca gtggcgtgcc agacctgcct 
ggatcctgga gatctctcat caccctccaa gtcagtcatg cccaacccag ggacccacag 
cccacggggc cgtgaaggtg tgctgagtcc aagaaggcct tcgacactgg gaagccaagt 
ggcacctcct ggtgtggagc aggcggaatc ccaccagcct ctgctctgcc agtgggcaca 
gctggacgat gagcagaagg ggctgttgct taataaacgt catttcctta agaggataaa 
acctttcaaa acagatggaa attttttttt aattaaaact ggtggccaaa gagatggaaa 
gcaccccttg tgcctccctc ccatcgtgac ccatcctctg cacacctcaa gctgttcgct 
gcccaggtgt ctcctgaggc actgggggcg ggtgagaatc cgtgagccct cggccagccg 
tggctctctg gagctctgcc ccaggccatc agggcacacg ccgggcaccc tgggggccac 
acagggcaga gcccagctgg gtcagcacac agggccacac tgggcacaca agtctctgag 
cctcccctgt ggacgcagct ctcactatcc caccccacta ggtcccgggg atctgtccca 
cagggtgata tgctgtcaca gaccactacc agagccatgg cctgctgttc cgcccgcagc 
caggtagtca cttgctccac agggacaggc aacgccgcac ttgggggctg ctctgcggca 
ggactagagc tccagcagct cagccctcct gagaaggaga actccatgct ctaagaggca 
gacgcagcgg acggcaccaa agccaccaca agcccacggg gccctgcatg gcaggtcagg 
agtccctgac cactcgctct ttgtaaccag agctgcagtg gagtctacga ggcaaggact 
gtgggcggca gtggccacag caaatgaatg agtgtcccaa gggagcaggc ggctgcgggg 
aggcacagcc gggacccagg agtcctccgg cactgcagca aactccctgg gccccctgag 
cagcgaccag gtggcaagtg catgaactcc cgggggcata acctgggagg gtgacactct 
cttcgtgttc aaattcttga gaacgcatta aaaatatcac tcagtcacct actctatagt 
tttaactcaa aagtaccaaa gtagccaggc gcggtggctc acgcctataa tcccagtact 
ttgggaagct gaggcaagag gatcacttaa gcccaggagt tccaaatgaa cctgggcaac 
atggagggac cccatttcta caaaaaaagt gttttaaaaa attacctggg cctggtggtg 
tgtgcctgta gtcccagcta ctcaggaggc tgaggcggga gaaccacatg aacccagggg 
aggtagaggc tgcagtaggc tgtgatggca ccactgcact ccagcctggg taacagagtc 
agactctatc tcaaaataaa tttaaaaagc accaagccag gcttggtggc tcacacctgt 
aatcccagca ctcagggagg ctgaggcaag tggatcacct gagtcagaag ttcgagacca 
gcccagccaa catggtgaaa ctccatctcc actaaaaata caaaaattac ccaggcgtgg 
tggcgggtgc ctgtaatccc agctactcag gaagctgagg caggagaact gcttgaaccc 
aggaggcaga ggttgcagtg agccaagact gtgctactgc actcaagcct gggagacaga 
acgagactcc atctcaaaaa ataaataaat caatcaaaac caccaagact ttttaatata 
aacatttatt attccataat tccttttttg catgattaaa aatgtttata taaagtttcc 
tgaaaatggt aagaatgcca agtgaaggct gcaaatgccc aagcccccac cgtggcatct 
cacggagtct gggccctagg aggctggtgg gtaccacgtg gacccgagac ttcacagtca 
agtccctttg gggtacactg ggtttcccac accccagaaa tatgggctct tactgcagga 
ccatgggggt cctcacactt ggcccagaag ctgtcacata gccagacagg tgttctacaa 
cctaggctag agggagctca tgctccagca gaattcgagc cagaggaggt aaaagatggg 
taagatctgc tccctggaca gatgaggcct tggcctcaga acagttactg atcatctacc 
agacatcaca ctagaggcag aggggcgcag acgaagacag cccctgtcct caaggccctc 
ccaggttggg tggaccatgg aaggttccag acagatctgg caagagaagt gcccacacca 
ggggcagaag atgggcaggt ctgctcaggg cggcacggcc tgccaggcca aaaagttcca 
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acttcagatg ctggagaatg ggcacgactg tctgagaaag ggaaggatgt gatgaaaact 19800 

acttggagaa aaattaatct ggccagagca taagataaat gggcaaaggg gaggttccag 19860 

aaagcaagga gaccaagtaa aagctgatgt cattggctct gaatctaggc tttcactgaa 1992 0 

tatgcaccgc agggcctgta ggtaaagcct cagagcccag ggagtctgag tggaggagag 19980 

ggcaggggac agagctgggg cctgtgtcta cagtgctcag gaggaatagg catggacgtc 20040 

agctcggagg ctccagctga agtgaggagg cggccagggc agcacggcca cgcccggatc 2010 0 

cagactcctt ttgggaagca agttcgctct gggggaaagt ttggagaaat ggcctttacc 2 0160 

cgcagaagca agccccagaa catatcttgc tccaaaacta tctcgtacag tgaggacgtt 20220 

aagcttcagg tcccctagag gagacagtct gctccttcct ggggcagaac ccaaggtggc 20280 

cagagcctgg aaggcaccca gcacccaggc tggtgtgttc cagcccaggc cacacgctca 20340 

gatagctatt aatgccccgt tgagcaattt cctgagagct ttgccaggca ggtaccgcct 2 04 00 

ccccatctga actaatacag gggtacatcc caaggaagaa atgaaaggtg cccacatttt 20460 

gctctgggat taactaggga ggggagtgat aattaactca gtaattatat ttgccatcgg 2052 0 

gctaatgcta aaattagtgt gcattagaat ttctttcctg agcagacacc ggagtgagtt 2 05 80 

gggcagcagg agtggctcgg gcaagtcggc acaaagggca cctccagagc cttccacaaa 20640 

tgtcagcaaa acccacaaat gtcaaggccg gctccactgc acccagcaga tgaattcact 20700 

tccacagcct gagaccgcca gctcatcgga ggccatttaa aatccagccc tctgacacct 20760 

gctggatatc accatttacc gtccccagat caagagatca aagggtggaa cctgatagga 20820 

cggctctgaa gttcaccaca aaagcataaa cgtgcaagca gagccaatac gtcttttgaa 2 08 80 

aaggacaatg aggtgggaat ttacataact gatcttaaaa tatgttctga tgcttcagag 2094 0 

atggagacag cagcattccg gtacacaaag acactcacag gcagtggagc acagtgaagg 21000 

gtctggaatc aggacccagg tgtctgtgga cactacacat aaaagagcag catttacaat 210 60 

gaatggatag gatggaccat cccaccaagg tgttggacaa ctccctattc actggccaga 21120 

cccctacctc ataccatata caaaaaaaaa aaaaaaaaaa aaacccagac agaataatgt 21180 

ctgaatgtaa aacataaaac agtaacagtc ctggaagaaa ataatggagg atatatttat 2124 0 

aatctggaga tggagtaaca agggatagga aaaaagccat agggaaaaag tagagttatg 213 00 

attatatgaa gcttcttaat atctttatga taatgtacca ccagaaacaa ggatgaagga 213 60 

ctagctacag accagcagtg aaacctgaaa caaacagaac aaagaattaa agtccatacc 2142 0 

aaataaagac ctcccacaaa tctataagaa aaagataaac aggctggcac cgtggcttat 214 80 

gtctgtaatc ccagcacttt gggaggcgga gatgggtagg tcacttgagg tcaggagttc 2154 0 

gagaccagcc tggccaacat ggtgaaaccc tgtctctacc aaaaatacaa aaattagcca 21600 

ggcgtggtgg cgcatgcctg tagtcccagc tacttgggag gctgagccag gagaacagct 21660 

ggaacccggg aggcagaggt tgcagtgaac caagatggca atcgcgccac tgcactccag 2172 0 

cctggaggac acagcgagac tctgtctcaa aaaaaaaaaa aaaagaagaa gaagaaaaaa 217 80 

gaaaagaaaa agacaacaga aaaatgggcc aaggataagt gtaggcaatt tgcagaaaag 2184 0 

taaataccaa taaaccagaa atgagggttg tgcaaatcaa aaggtgttat aatttttaac 21900 

caaactggac caaagaaaac accaaaaacc aaaatcttgt aattgccagc atcagagagg 21960 

atataggaaa gtgtgtgttc tcgtagatgc ttgcaggtat gaactgctac agccttttag 22 02 0 

gagttatgta tgtatgtatg cttgtatgta tgtatttgag acagggtctc gctctgttgc 22 0 80 

ccaggctaga tctgttgcag tgctgtgatc atggcttact gcagccttga cctcctgagc 22140 

tcaatagatt ttcccacctc agcctttcaa gtagctgaga ctacaggagt gtgcaatcat 22200 

actcagctaa ttttttaaat tttttgtaga catggggggt ctcccaattt tgcccaggct 22260 

ggtctcgaac tcctggactc aagtgatcct cctgcctcaa cctcccaaag tgctgggatt 2232 0 

acctggatga gccactgtgc ccggcctcaa tatctttaaa aacagaaatg gacacactct 223 80 

ttgactagga atgtatccta taaaaacact tatacacatg cagagacaca cgagcaagca 22440 

tgctttgtaa tagcaatgaa ggctggaaaa actcctcaat caggtaaatg ctgtcaagtg 22500 

cacctgtgta ctatgaaatg gcacttggct tttaacaaga gcaaagacag aaaagcaaaa 22560 

gtacaaagta gggtgtgatg gcacatgcct gcagtcccag ctactcagga ggctgaggca 2262 0 

ggaagatcct ttgagcccag gagttggagg ccaggagctg ggcaatagtg agaaaaaata 2268 0 

aaattaaata ataataataa taaaataggc tgggcacagc ggctcatgcc tgtaatccca 22740 

acactttggg aggctgaggt gggaggatcg cttgatccca ggagttcaag gccagcctgg 22 8 00 

gcagcaaagc aagacaccca tctcaacgac aaattttaaa aaatcagcca ggcaggctgg 22 860 

gcatggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcagg cagatcactt 22 92 0 

gaggtcagga gttcgagacc agcctggcca acgtggcaaa accctgtctc tactaaaaat 22 980 
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acaaaaatta 
tcgcttgaac 
ctgggcgtga 
cacggtgggg 
agcgatccag 
agaccataga 
agaaggtgag 
gggccctgga 
ctcaggagag 
aaaccacttt 
aaggcccagc 
gatcgcttga 
caaaaaacaa 
ggctgaggtg 
atcactgcac 
gggtgtggtg 
agatggagac 
tagccgggcg 
aaggcgtgac 
cctgggcgac 
acattaaggc 
gagcaaatac 
gggtccacag 
ctctggccca 
gtcgcccagc 
cccctacaag 
gaccaccaca 
agcagcgcgt 
caacccagga 
ctgctcgtaa 
aaatatcttc 
gtcacaccct 
cccccagctc 
acaccatcgt 
ccagagctga 
tcgtagcaag 
ccctccaggg 
caaggccacc 
gactccccat 
gctgtctcac 
tcagggccca 
ctggctcgtt 
ccaccaggat 
gatcaatcca 
attattcagc 
cagcactttg 
ggccaacatg 
gggcacctgt 
ggcagaagtt 
gactccattt 
actcctgaaa 
cttcatgcaa 
ccaggggatc 

ggggtgatgg 



gctgggcatg 
cagggtggca 
gtgagactcc 
tgagggaggg 
aggcctcagg 
gattggaatc 
tgcctctcaa 
ttcttgcaat 
gagccgttgt 
cctccttgag 
gcagtggctc 
gcccaggagt 
caaaatcagc 
agaggattgc 
tccagcctca 
gctcacgcct 
catcctggct 
tggtggcggg 
cctgggaggt 
agagcaagac 
aaactaaaag 
gaaaataccc 
caagagaccc 
tcaggctcta 
agccacgggc 
gtgacaaact 
tggggctggg 
atgcgcatgc 
aggcagcccc 
ctcagggctc 
acccacctcc 
cggggtactc 
cctcacccca 
cccgattttg 
agagaatgca 
ccacagctgc 
cggcctggca 
aagccgagga 
ttggagagcg 
ggcttctgta 
ggcccctcct 
cctgaacctc 
gcaggaggca 
atcagcggat 
cacaaaaagg 
ggaggccgag 
gtgaaatccc 
aatgcaagct 
gcagtgagcc 
caaaaaaaaa 
acgttacagt 
aatgcccaga 
cggggagagg 
aaatctctta 



gtggcagatg 
gaagttacag 
tgtctcaaaa 
cacagaagca 
atcctgaagg 
agctggttct 
tgccccacag 
ggggtggagt 
catatccttg 
tgaaccttca 
atgtctgtaa 
tcaagaccag 
tgggcgtggt 
ttcagcccag 
gcaacagagc 
gtaatcccag 
aacacggtga 
tgcctgtagt 
ggagcttgca 
tccatctcaa 
atgtttaaaa 
agaaaacaca 
caagcacgag 
ttagaaaata 
tctggcacac 
aggacaaact 
gaatgggcag 
ctctgcatgg 
cactacccag 
caggccccct 
tggtccccat 
ttcaaaggac 
gacaccagct 
cagatgagga 
gagagcagcc 
ctccgcccat 
cagagcccac 
cctggatgta 
cattaagtgt 
acctgaactt 
ctgagatgcc 
ggcaggagag 
tgagacctgc 
gaactgacaa 
gctgaaatag 
gccggcagct 
gtctctacta 
acttgggagc 
aagatcgcac 
ataaaaggct 
aaccaaggaa 
acaggcagaa 
gaacgggaag 
taacttgata 



cctgtagtcc 
tgagccgaga 
aaaaaaaaaa 
gcgcctcttc 
gagaaaaaac 
attttacaaa 
ggaggcaggg 
ggagcctgtg 
aaggggtcct 
cctcagttta 
tcccagcact 
cctgggcaac 
ggtgcacacc 
gaggtagagg 
aagactcaaa 
caccttggga 
aaccccgtct 
tccagctact 
gtgagctgag 
aaaaaaaaaa 
tatatatatt 
atccccgcac 
agacagagaa 
aggctcttgc 
cacgagtcag 
ggaaaccaga 
ggacctcaga 
cttagggaca 
gtagggaacg 
cgggggcaac 
ctggacaagg 
agctggatgc 
cttgggaccc 
aactgaggct 
ggagccagcc 
cacacttgga 
accctgctgc 
gccccggctc 
ttccaaagcc 
ggccctcact 
agcactgact 
tctcaggcca 
tcgtgccggc 
atataatgtg 
gccgggcgtg 
cacttgaggt 
aaaatacaaa 
ctgaggcagg 
caccgcactc 
gaaacaccca 
gtcagccacg 
ccacagaggc 
tcaccgtgta 
gaagagaggg 



cagctactga 
tcgtgccacc 
aaaaaaaaca 
tgggggcacc 
gtgaagctcc 
aaaaggaaac 
agagggctct 
ccgcccccac 
tgagcccctc 
accacaagaa 
ttgggaggct 
atagtgaaac 
tgaggtccca 
ctgcagtaag 
aaaaaaaaaa 
ggccgagcgg 
ctactaaaaa 
caggaggctg 
atcacaccgc 
attaaatctc 
aaattaaata 
ccccaggaca 
cagtgtccct 
cactgagaga 
gccagcaaag 
ggctggacct 
gcgccaccca 
cagggagctc 
gataggacca 
cagcacagag 
gtgctgggga 
cccagggcag 
caccagcatg 
gagggctggc 
ggtgggtccc 
cctcactggc 
ttcctgaaca 
ccgccagggc 
tcacaaacca 
ctgccctccc 
ccccaacttg 
gatcctccca 
tgggagatgc 
gtccctccac 
atggctcaca 
caggagttca 
aattagctgg 
agaatcactt 
caacctgggc 
tacgtggtac 
aagacgcatt 
agaaagtcga 
atgggtatgg 
ttgtaaacac 



ggcacaagaa 
gcactccatc 
aggagccagg 
cccaatctct 
gtgctagaag 
tgaggccctc 
gagccctgca 
caggcacctt 
aaaaggctaa 
aaactacatt 
gaggtgggtg 
cctgtctcta 
actacttgcg 
cggtgactga 
aaagcaggcc 
gaggatcagg 
tgcaaaaaat 
aggcaggaga 
tgcactccag 
aaaaaaaatt 
cactccaata 
acctcccagg 
gtggcggaac 
aagaggcaca 
tgtcaactgc 
ggagcacagg 
catgcctaag 
cccccacccc 
gcaccccgtt 
ctcagacccc 
ctggctctca 
gagcttttgg 
ggcaaggtgg 
acacggctct 
tgaggccggc 
cccaggacag 
aataagtgaa 
ctccccaaca 
cagatgtccg 
agcactcctc 
tccccatcac 
ccagccacct 
aaccaaccaa 
acaatggaat 
cctgtaatcc 
agaccagcct 
gcgtggtggc 
aaacccagga 
aacagagcaa 
tacttggatg 
gtaagattcc 
ctggtgttca 
gttttatttt 
tgtgaatgta 
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ccaaatgcct gccttctata ctttaatatt ttatattata taagtttcac ctcaatttaa 26280 

aaaaaaaaca actcgacacc tttcacctag gaaagatctg gctttagctt gcatttcctg 26340 

taactcctgc ctaaagcctt ccagaagctt ccgctgcctt gtggatcaca accagactcc 26400 

acaccatgat ctggcctcta agggcctctc gcaggacacc ccgagggtga aggagcaccc 26460 

gtgggcccac ctctgcatag ctgcaaagct tctttccctg tcctcccctc tacatgggaa 26520 

gctctgcccg caggggcggg gccttatctg ccattctatc gcactcaacc ctagcacttc 26580 

actcggtagc agacaccaaa gcaaaacagc aacagcatta taccgggcca ggtgcacgtt 2 6640 

aactcactga attcatggta ggaaggattc tattcccatt ttacaggtga gaaaactgag 26700 

gcacacaaag gtagcatcag cttcctaagc ctcccagcac aggaagcggc caggctggaa 26760 

tcagaccctg ggcgcagggg ctctgtccac agtgctaact aactactcct gcccccgagg 26820 

gctgcagcgg tgagtgagtg agtttgtcag tggactggat gtccaaggtc atacaggaaa 26880 

aatccagact attgtaataa cagcctctag accggctggg gccagaaaga tcgaggacgc 2 6940 

tgacacacaa ctgcgctcac tgcagctctg ccagggatgg ggctaaaggt ctcacacagg 27000 

gcagttaggg ctccccatag cctgggagag gaacggggtg agataacaga aactaggtat 27060 

ggtgcccgaa gtcaaacagc cactgagcat gtaaacccag gtgggtctga ccccaaaccc 2 712 0 

ctccaccccc atcagccctg caacccgtcg ctgcaaggga gaaagcaact cagaggcctc 27180 

acctgcctac atcccccacc cgtgtgtgtg agttctacta aatgcctgag cagtgacaca 27240 

gcacggctga aattaaacgg gttccaaaaa cgacaggaag cacgaagtga atctccccag 27300 

gaaagtgctg aacaaatgct ggatcgggtt caccggcgaa tttcttggaa ctgaagaggg 27360 

gagctaaaca cacggggccc tgctttggag gggactctct cagggtgctc cacacagcac 27420 

ttggttaacc ccactcagcc cttctgggct ctcccagagg gcccggcctt ggccttgggc 27480 

atctacagga ggaacctcca gggggagagg gggtgcctgg acaggccggc cctggaacaa 27540 

gcacttgggc cccgaggaga gaggactagg gcttgggagc tggggaagtt ctcagcactg 27600 

ggaccactag aacaaagcca tttccgtgcg ttcacagctt ccaattgcaa caggaagcaa 27660 

tcaggaaaaa taattagcgg cccacttact ggcttcgctg aggtccgagg catgtatttc 27720 

acacagtaaa accagggata taacatcaaa accgttctgc agaaagattc ctccctttcc 27780 

ttccatttta ggcctggatc accacattca ctggggctcc caggccttgc tgcctaatgt 27840 

taaaataatc aactctattt ttgcctcaca cacaactgaa ctctacagct ataattcttt 27900 

ctcctcaggg gctcgaacca catggacgac aggcatttga ctccagcaac atcaccccaa 27960 

aacgtgcaca aaacccaaaa ctgcaatgag gtgaaaggca acgcggtcgg cctagaaacc 28020 

ccccctttaa aacaaacagt ttccccaaaa ccccttttgc ctccttgacc caggcatttc 28080 

cggaaaaagg agcggcgctg gcctgtactc cccagatact gtcgctgttt tgtcttcacc 2 814 0 

ttgttttgct agctccagac aaggccccac aatgtaaaca cgctcctgaa agaggcagat 28200 

ttggggtgaa actgtccata gaatctctag gcttgggtca gaggcaggag gacgtgaaac 28260 

aaactccaag ctcctcctgt tccccgctgt cccccacacc tccaagcaga ggctgcagcc 28320 

tgggggatct gactacaggg ccaccccgct gcaccattca cactggaaat attcagggag 28380 

acagctgttt gccttaagga ggcccagaca aaggggcccg aggtcctccc cgctaaactg 2 844 0 

ccacaaacag aacaggagcc gcggcgtgca caggcacttg cggccgtgcc acttggccag 2 8500 

ccatactcca gaaaaacaaa acacgcacat ccgaagagaa tgatttaggt agcaagaggc 28560 

ttgcttgaaa aaccacatgg caatctccaa attaaaagaa catgtgtagc gtttcacgac 28620 

tgcttaagtt tcctgagtcc tcctgacctc aactccaccc cctgggaaac accaaaagtt 28680 

ggatgagaaa gttcccccgc cctacctctc cccacgggag tgtacaactg aggcacaagc 28740 

ctgcctcccc cactgccccg cgatctggga ccacgtctcc tccgcgtagc cgacccgggg 28800 

atggacacta tctggggacc cggcggccac acggggcatt cgggtcgccc gggcacctgg 2 8860 

caggtgtcag tccgcttgga aacccacagc cacgcggctc acaggagcag cgccaccggc 28920 

taggccgccc cgcgcccggg ctcagaactt tctcgctgcc acttcagccc gtcctcggag 2 8980 

cacgcggggc ggccgcgcgg ccgctggaaa caggcttgcg aaccggctcc ccgggccagg 2 904 0 

cccgcctccg cgccccaagt ccccgctcgg tgcccggccc gggccacacg ggcccagcgc 2 9100 

gggctcggct cggctcccgg cttcccgcgg gctcgggcag gtgaggaccc gcccgcgccg 29160 

cacctggcgg agcgggcgcc ctcctcgcca gcccgggacg cagcgtcccc ggggagggcc 2 922 0 

cgggtgggga gacaaagggc ccgcgcgtgg cggggacgcc ggggacggca gggggatccc 2 9280 

gggcgcgcgc cccaactcgc tcccaactcg ccaagtcgct tccgagacgg cggcggcgcc 2934 0 

cgcgcacttg gccgcggggc cgcccgggcc attgtccgag caacccgcgg cccgtcttac 2 9400 

acgccgggcg cgggaaggta tcgaatcagg 2 943 0 
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<2X0> 8 
<211> 33769 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 

<222> (33739) , (33749) , (33758) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 8 

cttcccctta 

gaggctctcg 

ggcaacgtgg 

gacggctcat 

gagggatggg 

acccagcggc 

gtgtgcagga 

caggtttttt 

ccccggacac 

cagagtggat 

cccaggaaag 

accaacccag 

atccggggaa 

acccccagct 

gggcttgagc 

actgtccact 

gcaggcggcc 

gggtaagtgg 

gaccataggg 

ctgaggccgg 

tatggccctc 

cagctctaaa 

agtcagtgcc 

tgccagccag 

acctcaggcc 

atgttggcca 

attacaggcg 

gctggggacc 

tgacgcctgg 

ataaaacagt 

cctaattctt 

caatggacag 

ggctccagcc 

tctgtggaga 

tcctccatag 

ggccggccag 

gtggatcacg 

tctactaaaa 

tcgggaggct 

gatcgcgcca 

aaaaaaataa 

cgcgtgcctg 

cttttgtcac 



cactggtcct 
gtctctaagg 
gggaacgagc 
tcggtgtcca 
gcgaaagaag 
ctgagccacc 
actcagaagg 
cacaacaaac 
ctccatccca 
gtcacaacct 
gtccagtggc 
gatagctgtg 
agctgagatc 
ctgctcccag 
cacctcctgg 
gcgcagatcc 
agattatttt 
tggggcaaat 
tcacaaagca 
ccactgtcac 
caggagcagc 
acaggaaagt 
ggcctcagga 
cccctcctgt 
catggctgca 
ggctggtctc 
tgagccaccg 
ctggtgaggg 
gcagccctct 
tggtgactta 
taaaagccag 
cagcggttac 
tgacgcatgg 
aggccttggg 
acctcagcga 
gcatggtggc 
agatggtcag 
atacaaaaaa 
gaggcaggat 
ttgcactcca 
aaaaaagaag 
ggtatagggc 
ccaccccgcc 



tcgacccgcc 
gaggtgggtc 
cacctacccc 
gggtctgaga 
ccacctgcag 
cctgccagag 
tcatcacagt 
aagataattt 
cctcatcacc 
ccctgcaccg 
ccccggaggc 
ttctcctggc 
cagggaaagc 
ctccagggcc 
gcctggggca 
caaggccata 
tgttgtttag 
gccgcaaccg 
caagggaagt 
ctgcacggtg 
tggcaggcac 
cgctgctctg 
agggacctgc 
ggccccccac 
tggccactcc 
gaactcctga 
cgccagccct 
aacacagcct 
catctttgcc 
aatgaaaaag 
cacagggtgc 
ttgtactgct 
ctgcgcccct 
ggccggccag 
gctctcggca 
tcatgcctgt 
gagatcgaga 
taagccgagt 
aatcgcttga 
gcctaggtga 
cagggccagc 
tcaggaccat 
tcagtggcag 



tcggatgaaa 
aggatgccgg 
tccactgaat 
tgtgttgaca 
ccccaggaac 
ccaggaggag 
taataccctc 
ttgttatttt 
cagccgcagg 
aaccaagtgc 
tcccaccgca 
ttcttttcac 
tgagaatcgg 
tccttctcag 
cgcaggatga 
aacagcctgg 
caattgatta 
cagcatttga 
ggctgggccc 
cctgggacct 
cttggcctgc 
cctggggtca 
aggcgggtcc 
tgcctgcctc 
ccaggcaggc 
cctcaggtga 
ccctgtggta 
cacaagtgaa 
taaaactgaa 
aaggccacac 
ctttctgaac 
catgacaccc 
ccgcaaggcc 
gacgccaggc 
ccatgtgcct 
aatcccagca 
ccatcctggc 
gtggtggcgg 
gctcagcagg 
cagagagaga 
cacggacgac 
gaccgctgca 
tagccaaaat 



actgaatggg 
ggacagggtc 
tgccctgggg 
ggaagaatga 
tatctggcca 
accctgccaa 
catgccccaa 
ggcaaaagga 
gccccggcca 
agctcccagg 
ggcctcccac 
acgggtagca 
cctctgctgc 
gtgcccttac 
acggggtcac 
ccacagtggc 
agtttctccg 
cccgggatcc 
gatgctggct 
tccagcaagc 
agtcaggggc 
gggcagccag 
cttcctctcc 
tgcccccatg 
agtggggatg 
ggagttccta 
ctaaacactc 
gtgtggtttt 
gaatttaggg 
tccccccttt 
ccaggcacac 
tgtctgtggc 
accccggtat 
ccagatccca 
caggcccatt 
ctttgggagg 
taacacggtg 
gtgcctatag 
cagaggttgc 
ctctgtctca 
ccctcacaca 
gtggccccca 
aacggattag 



tttagcctta 
ctcttcctgg 
tgtgggtacc 
aaggggatgg 
gcacaccgtc 
tgggtcacca 
tgtgggaaaa 
ggcagggcag 
tccctgcaga 
ccacaggcca 
cacagccggc 
gaaagctgag 
ccggacgccc 
aggaggcaga 
ggtgcaggcc 
ttcccagctg 
ctgcccccag 
tgtgccaagt 
ctgctggaac 
acagagaagc 
tctgtctgct 
agagtgacca 
catccctcgg 
ccccaccaca 
ggatttcacc 
aagtgctggg 
acaccccctt 
gttgagcaaa 
gcgtggatgt 
aggcaggcgg 
agtaggtgtt 
ctctgcagct 
acatggaaac 
tctgcgccct 
taagaagtag 
cccaaggtgg 
aaaccccatc 
tccaagctac 
agttagcgga 
attaaaaaaa 
gctcccagga 
agaaacgtta 
aatggaacca 



60 

12 0 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 
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tgtgacaatg ccactgcccc aactgacaga agatggctat cagcagttca cgcggcccca 2 64 0 

cctatcacaa gtgcagggca ctctacaact tatgcatcct tccccagaca ccgtcctttc 2700 

gaccctccca ggtcagcaag gcacacaggg cctacatttc acagccacac agcagagggc 2760 

tgaggctgga actcggatgc tctgatttcc gttcaatcac atccccagag gtggcacaga 282 0 

gacggggggc ttctcttgac aaagtcaaga aagtcactgc cagctccact gaagaccaaa 2 8 80 

gaacctcagc tctcaaaccc tcttgaaggt gttaccgaac tctcccagcc tgtttcctgg 2940 

gtcccgatgt tggtcccgtg ggacacagga agaggaagaa gctccctaga gcagagcctg 3000 

gtgcacctgc cacactctca gagggctgcg cacgggcgga ggagccgtgt gcaggagtgg 3 060 

ggtctggatg gaggggcgct gtggccgggg gcagggggca ggggaagggt gctccaggtg 3120 

gtgggcacag cacgagcagg ggcagggagg tccacactca gatgtgcaca gggagaaaca 318 0 

aatcgtgcat ttccattgga ataggcggta aaaggtagaa aaacagagtg ggggccagga 3240 

agggagtcgg agccttctag tgtctctctg caggtgagcg gcagcccgag gtgtcagctc 33 0 0 

agcagacttg gggtccaggg gccgtgtctt ctatcactga ccccagggca cacggaactg 3360 

999 a gggaga gcagaggcac agggcacggt cagtgaaacg aaacaaggag tcatcaccaa 342 0 

atgcggaaag ggcaaggagt gcccgcagcc gcacaagggt tctgtctggg caacgtgggc 34 80 

gtcccaccag gccccgcacc ctgcaagcgc aaagctcgcc actgaagata aagggaagct 354 0 

gttggagctg cggagctggt ctggggtccg catggagctg ggcttatgct gcagtcacaa 3600 

gggggacatg gaagaggctg caggggacaa aaccagtgac cacagtctaa ctctgagcct 3660 

gtggaaaggc gcccacagca ttcacccatc ccagagatgc cattccccct gtgcccccgc 3720 

tccacggtga cagcgttctc caggaatatg atgcgcccct ctcctcttgc atcagccctg 3780 

acagtgagta ttcaggccaa aaagcagaag agcacagctg cgtggttcca tttccatgta 3 840 

gttctggaac aggcaacgct aatccaaggt gatagaagtc aggagagtgg tggagggggc 3900 

gggggttgag gatggcaaag gggcaccggg aactttccca gtggtagaaa tgttctctgt 3 960 

ctggaccgtg tggtagttat gcagacatat gcagctgtca aagttaatcc aaatgtacac 4020 

gttaaaatgt gtgcgtttta ttgcctgcaa gttatacctc aattaaaaaa ataaagttag 40 80 

cactcaggct tcttccacaa cttcctgaac cgtgtgagct gattttcttg ctattaaaaa 4140 

ttcacggtcc atggctgaga acagcagctg ccttctgttt gcaaagtcaa cgccaatcac 42 00 

tgcccggccg cggcagactc ggccccacag gacctccttt cttttttccc tttgacctac 4260 

ttccctgata agtgacaaga cagccagact ctgggaacaa acgcccgtta ttcggccccg 4320 

agctgagcgg gccctgcttc ctgagctaat ccgcccggac agacggaggg acgtgagggg 43 80 

ctttgccgtc ggctccagct gtcagtctgc ccgtcagact cgacagtggc cccctctgtt 4440 

cctcccgctg cccccactcc atccccgact tctttttgtt tcctgtccct gacagacgaa 4500 

catctgttaa aactctgtct gggtgagctg tggccagcgg cccacaaatc cccaagccgc 4560 

accccagcct catctgggcg ctgccgggag cactgcctgg ccaccctctg gacatagctc 462 0 

tgagagccac cggccagggc acgtgtggcc cgagtggcat ggtgcacgcc gctaagccca 4680 

ctgcccaaag gcccccaagc aggagggatg tgcaggagac aaaagtcaaa agaacagggg 4740 

cacgttccac agaggatggg gctggagggg tggcagtgag gaacagcagc ttccgaggat 48 00 

ggcggtggca actcccaaat aaggcctcac tcctgctgtt tttagctcat tccacataat 4860 

tggaaaaaca tggcagaaac cgaagccagc tgctgccttg gtcctggggc tgtgtggagg 4920 

gggtggggag gccggaggcc caggctctgc actcgactgc tggggatgag agtgactctg 4980 

agctgcagag agcagcatcg cagccgccat ggtcccattg agccccggcc acgctgggcg 5040 

gcagaggctc gtgggatata cctgccctgt ctcatggggg tcacttcagg aggggcgggg 5100 

gagccaggac acagcccagg gctagcggtc accctgcagc tcaggggcca cgtaaatagt 5160 

gccaccttga aggcacacag cagtgcgggg ccccccccgc caccaacgca tccctacctc 5220 

taggaggccg cctgtgtgcc cctgggaacg ctgctccctg tcccttgggg tcctggtgtg 52 80 

accaccctct cagccccttc cttggggaag gcacctgact ccctacaccc agctggcttt 5340 

catttgctca aaatcaggaa aaagcagaat tcaagacatc acagaaatgt cttcgcctgt 54 00 

aactccatga aagataaacg gtcagacacc caggagggag tcccagggac ccttgagtct 54 60 

cacctgaggc tctggcttca aacctcgaga tgtttccagc catgctagcg ccgcccccca 5520 

caacctgccc cacacagtcc tcccttggga actcacagat ttggccccca cctgccccgt 5580 

ttcttctggt ggagtgggtg cgttgggttg gggtggggct ggggactctg gatgtgtctt 5640 

aagagtctga gtgattctga cacagccagg ccctgccccc ctcctgacct tcgccccaca 57 00 

ggaaagggag ccacacgcct gaagcgccca gcacaccccc ctccgtcc.tc cccaggtcac 57 60 

ccgctggccg tgtgagccgt gctccccact gccccttcac ccaccccagc tcctcctggc 5820 
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agcacccagc 
gcactgaccc 
caaaccccat 
cagagaggtt 
ccgcaagtgg 
ccacggtccc 
gacactgcac 
ccacggtccc 
gacactgcac 
ccacggtccc 
cctgacactg 
tcatcaccac 
ctgcctgaca 
tcaacaccac 
cctcttggcc 
cacagcctcc 
acagagcctc 
atccctgtgt 
acctcggaag 
tctgacatcc 
ccgtgctgtc 
gagcagctcc 
gctaattcat 
gccacagagg 
aaacattttc 
ttggatacta 
ccattctggc 
ctgtcctcca 
ctccctctgg 
ccatcctggt 
taaaacctct 
agggcccagg 
aacgggaagc 
ttacaaaaaa 
atctttctgt 
tctggaacca 
gtggacgttc 
ttctcctgcg 

agggcaaaaa 

ggagcgtgga 
ccaactggaa 
tgtcagctcc 
ctgcaggggc 
ctaatcacca 
actccccagg 
cgcaatgccc 
ctgttagaac 
cctcttctgg 
ccccaggtat 
cccagctctc 
atgaaggtga 
tgccaggcac 
ggtagcctgg 
gccatgagtc 



cttggaagct 
agacagcctg 
ggagcgggga 
aagtcacttt 
aactaggtgc 
gggtccgcct 
ctcctcgccg 
gggtccgcct 
ctcctcgccg 
gggtctgcct 
cacctcctca 
agtcccgggt 
ctgcacctcc 
tccttggccg 
taaacatcac 
ttcgttacct 
tatccttcac 
gaccccaggg 
gctctctggg 
cggctggtga 
agctgcaaca 
aggtagtttt 
gcccgccggg 
aaccgagaac 
ctttaacgtg 
aagccccagg 
cccgctcaca 
accagaacat 
attaagaaat 
gacaaaccct 
agaaaactct 
gcagggagac 
gggtggggcc 
aagtgaactg 
ctcattgagg 
aagcagaaag 
ccggcccaga 
tgccagtcat 
cagttcccac 
gctgcgtgct 
gggcagggct 
atgccacaga 
ctcctcctct 
cagggcctgt 
gctgctctgg 
ttgaagggct 
cctgggagcc 
ttgggaaaga 
gggaggattc 
acgggcctgg 
taagggcatg 
ccagcacggc 
gcaagttatt 
tgggatttta 



acttctgatt 
caccatcacg 
gtataatcac 
cctaagctcc 
tccaagtccc 
gacactgcac 
ccacggtccc 
gacactgcac 
ccacggtccc 
gacactgcac 
ccaccaccac 
ctgcctgaca 
tcaccgccac 
gctcccaact 
tcacaatgcc 
gcgtttccgc 
ccccggacac 
cctcctctgc 
cactggcagc 
ggtgccctgc 
agcgacagaa 
cagtcgaggc 
cgcacggccg 
agcgactgtg 
atattttacg 
ctgccgcgtg 
ggtccggctc 
gacttcctta 
atgggaagat 
tgaaatgctt 
atttccacac 
ggaggagcca 
catctccagg 
aaacgctcac 
agctttttcc 
attcaggctc 
gtctcgccca 
gctccaactg 
aagactagag 
cagccctgcg 
cgctgctagt 
gcccagggct 
gggtagccta 
ccccccctct 
gcctggacat 
cagccacgag 
agcaaagagc 
aatcaacgcc 
tgggacgatg 
caccagccac 
gacagtggac 
tccgtcaccc 
tgggggtctc 
tcagggccca 



acaaccgccg 
ctgctcagca 
cccctttcta 
caacacgatg 
cggtctgcct 
ctcctcgccg 
gggtccgcct 
ctcctcgccg 
gggtccgcct 
ctcctcaaca 
agtcccgggt 
ctgcatttcc 
ggtcccgggt 
acaaaccaag 
tccctcggga 
tagcccaggg 
tggacctcac 
ccgactctga 
cctccatggg 
acgaggcctc 
tttcacgttt 
gaggcgtccc 
caataccaat 
ctcaggtgac 
gcaaggaatg 
gtctgctttg 
tgactcacca 
aggacaaagc 
cttctagaac 
ctaagaagag 
cagctgcccc 
gcatccacac 
cagctctgag 
gtcctcatgc 
tccctttgac 
agagtaaaac 
agcagggcct 
agttatgtgt 
aaaggtgacc 
gccctgacgg 
ccagcggtcc 

ggggccagag 

accaccccct 
cccccgcttg 
ggagactggg 
cagccagtcc 
caggggctcc 
cctttactgg 
caggcaaacc 
agcacctaag 
atggcagctg 
ctggatgagc 
caagcttgtc 
cacccgttcc 



aaggaagact 
caacccacac 
ccaacggaca 
acaaaaaata 
gacactgcac 
ccacggtccc 
gacactgcac 
ccacggtccc 
gacactgcac 
ccaccacggt 
ctgcctgaca 
tcatcaccac 
ctgcctgaca 
ccatgtcttc 
acaggcacaa 
ccagctccag 
caacccatag 
atttcactgc 
caccgctcct 
tgcccactgg 
tcttcacgtt 
gtcagcagcc 
gggcacctgc 
aggactgtgg 
aaacctggag 
tgaagtctga 
gcttcaatgc 
cgtttctcgc 
cacctcaaat 
tttaggtttc 
tggaacactt 
cgagcaccag 
gtcagactgg 
aaaaccagac 
agaacaccct 
agtccccaca 
ataaatgaca 
aaaagtgcct 
cctgacggct 
ctctggaatg 
aaccccacag 
ccaccaggcc 
gtgagcgcag 
caggaaaatg 
aattacattt 
ccagggctca 
acctaagtct 
ctcccactga 
tggaccctga 
gcgccggtca 
gacactgggc 
agtggccctt 
cagctgtgcg 
tggaactctg 



cgctccctcg 
agccttcctc 
aactgaagca 
gaaggtcagc 
ctcctcgccg 
gggtccgcct 
ctcctcgccg 
gggtccgcct 
ctcctcgccg 
cccgggtctg 
ctgcatttcc 
ggtcccgggt 
ctgcactttc 
catcctgaat 
gtcccaccag 
agccctcacc 
cctggaggag 
ccaacgtgac 
tctggccagc 
gacctcacag 
gcccctgggt 
aggcggcaca 
agcctggaaa 
tcttttaaca 
ggcaggacat 
agcccgcgcc 
taggccgtgc 
ccatccccat 
ttgcagagag 
ttctcaactc 
cagcttcaaa 
cctgttaatt 
ggaaccatgc 
tcccagttgc 
acacacggca 
ctggctgcat 
caaaatgttt 
ctcacggctg 
gagtctctag 
gaaaagctat 
gtgtctgtgg 
ccctgccagc 
gcagcctcct 
agccctgagg 
gcagaaggag 
gaaggcccag 
atagcccctg 
cagcccactc 
gtgaacctgc 
tggtgacaac 
acccactgga 
tgcaagccag 
acttcactga 
atacgtgagg 
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gagccacaca 
cccaaatagc 
agagacaggg 
ttccactgtt 
gtgtccattc 
aacagagatg 
agcagatgca 
gccctgctcc 
aacccacagc 
gcgggaggat 
agtggctcat 
ttaggagttc 
aaattagcca 
gagaattgct 
aacagagcaa 
atgtctttga 
aataaatagc 
aaaaacgcag 
ccaagagagt 
cccgttacag 
aaggtaggaa 
cccgtgaagt 
ccttctcgcc 
acctgccggc 
gttccgtacc 
tggcccccac 
cttctcagat 
tcagactagc 
tgttgcccag 
gctcaagcaa 
actgcccagc 
ggtcttgaac 
acaggcgtga 
gtgattctct 
gagcccgtat 
cagggtccag 
aacaaactct 
tgaggcttgt 
ctgcgaggga 
ctcatggctc 
aaccccaggc 
tcaaaccact 
cctggtgtcg 
tctctctcaa 
tgatggagga 
ctcacttgtg 
gcccgggatt 
caagtgttcg 
agcagatgaa 
tttctcaaat 
aaccatcccc 
cattaaagat 
caaacgcagg 
cacacacaca 



gggaccctta 
tgggattaca 
tttcaccatg 
agggcaaggc 
cccactcctt 
gtgacacctg 
acacagtgtt 
ctctaactcc 
agcaagactc 
gactaaaaga 
gcctgtgatc 
aagaccagcc 
gacatagtag 
tgaacctggg 
gactctgtct 
tacatattta 
aggaagcctt 
ggagggctgg 
aagagaccac 
aatttgcggg 
tttcaggtca 
cggtcccaac 
tttccctctt 
tcaggcggcc 
cgatccaggt 
cacacgtgtg 
attttctgga 
caggacgagc 
gctggagtat 
ttctcctgcc 
taatttttat 
tcctgacctc 
gccactgcgc 
ctcccctccg 
gtcctcactc 
ctccagcact 
gagaaggcca 
gcagccgagg 
gaccagggcc 
tgggagccag 
tggtgacttg 

cctccccttc 
ccaggtcact 
cattgccttt 
gcgggtgtcc 
ccgtctcccc 
ctgtctgcac 
acaaatgcct 
tgaatggggt 
tctccttctc 
tgcagcagcc 
gccctgctct 
acccctgcca 
ccccagccgc 



acaaaagctc 
ggcgcacgac 
ttggccaggc 
acctgacagg 
cgacaaatgt 
cttcctacat 
taggcagaca 
agtggccagg 
agaaatggca 
catgtcttga 
ctagaacttt 
tagccaacat 
cgggcgcctg 
aggtggaagc 
caaaaaaaaa 
cctcctgcaa 
acggttcgat 
ggaacaaaaa 
ctccagacct 
gaccgctgca 
acaacagagc 
accagcatgc 
ttagtccttc 
acacccccag 
ctgtcctatg 
cctcggggca 
atatttgtgg 
tgcatttttt 
agcggcatga 
tcagtctccc 
atttttagta 
aaatgatcca 
ccggcccgag 
tcccccggcc 
tttccctttc 
gagttcagcc 
aggttcccgg 
caccccctcc 
cttgctccac 
accccacctg 
ccctggcctc 
aagggtctat 
caggaggcag 
gaggccgagg 
aagacactca 
cacccaccca 
cgcggggtcc 
gctgaatgag 
gctgtccaga 
ccgacttggg 
ggcacagcgg 
cagaaagaga 
gggagcagag 
tgcacacaaa 



ccagggcaac 
taccgcccgg 
tggtcttgaa 
cacgactgca 
ccacacccag 
gcccattgct 
tcaccaatcg 
ccccaagcca 
aaaacacaaa 
taagatattg 
aggaggctga 
ggtgaaaccc 
taatcccagc 
tgctgtgagc 
aaaaaaaaaa 
tcgcaaatgc 
cacccacaca 
aacagaagat 
cccttcagct 
aaatggaagt 
tcacctcata 
tcctgcctca 
agatctcagg 
tacaggagtc 
tccatctgtg 
ggggaacagg 
atattgggca 
tttttttttt 
tcttggctca 
aagtagctgg 
gagatggagt 
cctgccttgg 
ctgcctgttt 
ctgactgtgg 
cgccaggact 
tcttctcacc 
gcagccagca 
tccagggagc 
cagggcccca 
ctagaaccta 
ctctgtaaac 
aatcctccct 
ctcatctgga 
tgaacggtca 
ccctttctaa 
gccccagagc 
ccagtgcctc 
caaatggatg 
gccgtgagga 
aaacaagatg 
acagaccctt 
aagacaaata 
aaaagaccca 
cacagaccct 



atgttctctt 
ctaatttttg 
cccctgacct 
cgatctgctt 
ccttgctttg 
ctcccaaggc 

atggtggcaa 

gctctcacct 
gagaacagaa 
ttcaggcata 
ggtaggtgga 
catctctact 
tgcttgggag 
cactgtactc 
gatatccttc 
ttctgcagtg 
gacacacagt 
aaaatgtgga 
tctcaaacac 
gcagacagcc 
tgactacaca 
aagccgctgc 
cctcctgaga 
tccggctcag 
tgccggcttg 
cccgtctcat 
acatatatgc 
tttgagacag 
gtgcaacctc 
gattacaggc 
ttcaccatgt 
actcccaaat 
tacacctttg 
tggccactcc 
tcaaccaaca 
aacagacagg 
agccaagcat 
aggcagcgtc 
ggtatggggg 
ctatgccacc 
aaagggctca 
taacctgctt 
ctccttccct 
acagcgaagg 
tgcactgact 
ccagagtgcg 
ggagcaatgc 
gatgaacgaa 
ctaggccgcc 
cttggtcggg 
tgatgtaaca 
caaacctgga 
cacgccacgg 
agccagcaag 



gcctcagtct 
tatttttagt 
caaatgatcc 
gttgggggct 
acaccccaag 
agacatcccc 
cagacaccag 
gcccactccc 
acgccccata 
ggccaggcac 
tcacctgagg 
aaacatacaa 
gctgaggcag 
caacctggac 
actaaaactc 
cataaagtga 
cacatacagg 
gacagacaca 
acgagccggg 
ccttactcaa 
ggtcacacag 
acgtgctgtt 
gagacctctg 
cccctgctgt 
cttcctgaca 
taactgcttt 
tccacctttt 
ggtctcactc 
cgcctcctag 
ccgtgccact 
tggccaggct 
tgttgggatt 
ccatattccg 
ctgccgtcat 
ctgcagagcg 
caggaaagaa 
ccttctccgc 
ctggggcagt 
cagcagcaaa 
tgctgtgggc 
tccaacctgg 
ggtccaaacc 
gggtccagtt 
gccccagagg 
ccctcgtgga 
agcgccagag 
cagcacccgg 
tgaatgagca 
caagtcccca 
gaggctctcc 
gccatgtctt 
aaatcctcac 
gcgccacgac 
aacaggggga 
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ccaggaaact 
cacttctcca 
gcccttagga 
agtgggacac 
catcacgcta 
gttaccccct 
gcccgggagc 
cgccccgaac 
atgcctcact 
aaaaaagtgg 
ggctcaagca 
cacacccgcc 
ggcggccgca 
cgcctcacct 
gcatgtgtag 
ctcctcacag 
ggaccgtcac 
tgtgtccctc 
ttccacctgc 
gcaatttccc 
cccatggagt 
cctccccagc 
ccaatggccg 
ctcctgggca 
gaacactgtc 
caacagcaat 
gctgggtgta 
cctctccctc 
caggaaacag 
cctaggcccc 
ccatccctgg 
gaagctggtg 
agaactgctg 
tggtggctca 
cccacgagtt 
catacacaca 
tatatatata 
acttcttctg 
gcctgaaata 
cagccacaca 
atgggcttct 
acgcccccgc 
ctctcccccg 
gatgtccatg 
gttgcaggca 
cgtcagggcg 
gcggggtgga 
ttcttttttt 
taatctcagc 
cccgagtagc 
tagagacagg 
cacccacctc 
ccatttgctt 
gtacatttct 



gttcctaaag 
tccaccggat 
ctgttcctga 
cggcacagga 
cagaaacggg 
cgcttgccag 
atccccgaag 
cgacacctgg 
gcaacacagt 
aaaatatctg 
atcctcccaa 
aagtggatca 
acccattccc 
gccttacacg 
aaaggctcgc 
cgagtgtgga 
ttgtttctga 
tgcccaaaca 
ctcatcgctg 
caaaaccagc 
gacatcctcc 
tcctcggagc 
gggtcaggac 
aaaggcttgt 
cttcttgctg 
tgaacggaac 
gcagccccaa 
cctcatgctg 
ctccagagag 
acatttcaat 
tccctgtggg 
catttcccca 
attctgggca 
tacctgtgac 
tgagacaagc 
cacacacaca 
cacacatacg 
ggccttgaaa 
gcccccatga 
cctccctcca 
gcagggaggg 
atcctcccac 
tgccccaacc 
ggtgagtata 
tggatggagc 
aaggggtgcg 
ggagccatca 
ttttttgaga 
tcactgcaac 
tgggatcact 
gtttcatcat 
agcctctcaa 
tcaaccagac 
cccggcgtgc 



tcaggacccc 
gccaggagag 
gtgtggaagc 
agcaggccct 
gaggactggt 
gccccagggt 
gcgagctgca 
aaagaaagat 
cccaggaagc 
gggcaaaaat 
ctcagcttcc 
tttcgaacgg 
gccaggcccc 
tcctgccgfct 
ttggggacag 
ctgagaaaac 
atgagtcaca 
tcggcctcaa 
gccccgtcca 
cgtgaccttc 
ccatctgctt 
ccccaaagga 
ctgcctgtgt 
cttgtctttg 
ggatggccaa 
tcgataaatg 
aatacggata 
ccccaagtgt 
ccccaccctc 
tcaaagtcca 
agtgggccac 
tcagggacct 
gattattcat 
cccagcactt 
ctgggcaacg 
cacacacaca 
tgtatgtgta 
acgaggcaac 
gctctgcaga 
ttgcagcagg 
gtcacctgcc 
tcccctgccc 
ggcactcaca 
gggcactcag 
gggtctgcca 
tcaggctgcc 
ggagggtccc 
tggagtctcg 
ctccgcctcc 
gatgcccacc 
gttggcaagg 
agcgctggga 
aagtgaggcc 
acagggcacc 



catgtgctca 
tccttttagg 
cagcccactt 
gtcccccacc 
cccagggatg 
cagccccaga 
tcctgaacgt 
cctcagccgg 
accaagtgcc 
aaaacaaaac 
cgagtagctg 
gtttgccgag 
gccccgcccg 
gtcctgcagc 
caggcacagg 
caggacagac 
tcctgcctcg 
gtggcatcag 
catggggccc 
ctggccaccc 
cctcccacca 
cacccctctg 
ggtagtgacg 
tgctatgtgt 
gcttgtcact 
cttccagcac 
taactggaac 
agatgggttt 
ctgtgtcctg 
aaccttccat 
tgtccccaga 
ctgtcacaac 
tgataaatac 
tgggaagtca 
tggcaaaatc 
cacatatata 
tataaataca 
cttccttgga 

ggggtcctct 

taccccttta 
cccccacccc 
cgcgcccccg 
aaaaggctgc 
gatctccttc 
gtctgtccag 
ctccaccacc 
ccgacagtca 
gtctgtcgcc 
cgggttcaag 
actacgccca 
ctggtctcga 
ttacaggcgt 
aggtcaagag 
tcccaaacac 



gacagcagtg 
gggccccaca 
ggaagccccc 
actttctgca 
gcgctttcct 
ggccagactg 
gtgtgatttc 
tgccccagag 
tgaggaccaa 
aaaacaggat 
ggaccacaga 
gttccttctg 
cccgccccgt 
tgcacacccg 
tgggagcagc 
tgagagaagg 
tcccccgtga 
ggacctcccc 
tcagcctggc 
tcacacccag 
agctcctatg 
caaaggctgc 
ggaaccccag 
ggacccagca 
ctcccaagcc 
ctcattcaaa 
aacaaactca 
tgtgaccacg 
ctctgggaac 
aatggcctgg 
gccgcagccc 
ccagcgtggc 
gcgacttgca 
aggtgtgagg 
tctcatctct 
tgtatatata 
tatacacaca 
aatccccttg 
gcaggcccgt 
gagagggggc 
acccacgccc 
ctccccccag 
cgctcctggc 
ctcttccccc 
tacagagtgt 
ttctgcctgc 
ttgctgctga 
caggctggag 
caattatcct 
gatgattttt 
actcctgacc 
gcgccaccat 
ccccaggagc 
agcctgtgat 



agagcaagga 
ccgagactct 
tgccctcccg 
agctgggccc 
gacacctctc 
gctatcccag 
ccgaagggcc 
gagaagagcc 
ggcggagagt 
tgacctcctg 
cttgaatcac 
gggcaccccc 
cccgtcccac 

tggggcaggc 

cgccattgtc 
ttccagaaga 
cagcctccag 
gcgggcacca 
cagacggcct 
atgtgacctg 
actagaacac 
cccccacgct 
agacaatggg 
gcttccatag 
ctcctatgac 
ccaggggaaa 
tcaaaatgaa 
actttctcac 
agctggcacc 
ccagaaatct 
cactgtcaca 
ccccaggctg 
gggccaagca 
atcactggag 
attaaaaata 
aataaccata 
cacacagaca 
ccactgctga 
gtcccccagc 
cccccagagc 
gcgcaccccc 
ccccctcacc 
tcagcacctg 
cagtgcgctt 
ccccggagag 
agtcagggaa 
cccaattaat 
tgcagtgatg 
gcctcagcct 
gtatttttag 
tcaggtgatc 
gccaggcttc 

tggcgccctc 

ggtgacacac 
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gggctccccc aggtcaagtg gcaaagtctc ccccagggaa gaaaggagga agccatgcct 15600 

ggcaaaaagc acacctctcc tgcccaacgc tttaacctct gtatacaaat caggccatgt 15660 

gcactcgctc cttcttacaa tgctcataat ttatactttc agagtaaatg aaacttggca 15720 

tcaacccgag aaacagctat tcttttctag atgcttacag tgcccagcaa atgaggactc 15780 

gggtgtaatg agattatgga cactggaaac aggatcataa tgtgacgtgg tcggtaatgt 15840 

gcagttttat ttgcttaatg accctcgccc cgtgacaggc tccctgaggg tgggcctggg 15900 

ggcagaggtc cccgccacgt ccccagccct cagcacagtt gccaggagag ggtgacactc 15 960 

atgaagtggc acagggaaga tgggagctgt gggctctgca gatccaccac ctcttctgtt 16020 

catttttgtt gatgctgttt tttaagaaaa ttattgaagt aaaattcaca ggacatacgt 16080 

ttactttttt tttttttttt ggagatgggg tctcactctg tcacccaggt tggagtgcag 16140 

tggtgtgatc tcagctcact gcaacctctg cctcccaggt tcaagcgatt ctcccacctc 162 00 

cgcctccaga gtagctggga ccacaggcgt gcaccaccac acccagctaa tttttggggg 16260 

gtatcttttt ggtagagaca gggtttcgcc atgttgccca aggctggtct tgaagccctg 16320 

agctcaggcg atccacccgc cttggcctct caaagtgctg ggattacagg cataagccac 16380 

tgcacccagc ctaaatttac cactttaaag tgaatagtgt tacctagtgc attcgcaagg 16440 

cggtgcagcc tccacttctg tctagttcca aagcacttcc attgccccac aggcaaaccc 16500 

cacacccggc agcagtcatg ccccagtccc cgcccccagc cccggcaaac acttttgatg 16560 

gacttaacta cacacattct caacatctca tataaacgga atcacaatat acagcctctg 16620 

atgtctgtct tctttgactt ggcaccatgt tttcgaggtt catccaggct gtagcatgtc 16680 

agtgcttcat cccgttttag gggtgaacca tattccagtg tgcagacaga aaccaatctg 1674 0 

tgcatccatt cacccactgg gggacctttg tgtcatttcc accctcggct gttgtgcaca 16800 

gtgctgctac ggacattact gtccattcac attttgtgtg aagacctgtt ttcgattctt 16860 

aagagtatac agctaggagc ggaattgctg ggtcatacgt aaatcaatgt ttacgtctca 1692 0 

aggaatcaac aaactgtttt ccacaatgtt gtcttttttg tttgttttct gagacagggt 16 980 

cttgctctgt cacccaggct ggagtgcggt ggtgtgatca tggctcactg cagcctcaat 17040 

ctcctaagct caatccatcc tcctgcctca gcctcctgag tagctgggaa cacaggtatg 17100 

taccaccatg gccagctaat tttctaattt tatttttttt tgtttttgtt tttttgagac 17160 

agagtctcgc tctgtcgccc aggctggagt gcagtggtgc catctcagct cactgcaagc 1722 0 

tctgcctccc gggttcacac cattctcctg cctcagcctc ccgagtggct gggactatag 17280 

tcaccggcca ccacgcctgg ctaatttttt tgtattttta gtagagatgg ggtttcaccg 1734 0 

tgttacccag gatggtctcg atctcctaac ttcatgatcc acctgccttg gcctcccaaa 17400 

gttctgggat tacaggcgtg agccaccacg cccgacctta cttttaattt tttaatttta 17460 

ttattttatt ttattttttt tttttttgag acagagtctc gctctgtagc ccaggctgga 1752 0 

gtgcagtggc gggatctcag ctcactgcaa gctccacctc ccaggttcac gccattctcc 1758 0 

tgcctcagcc tcccgagtag ctgggactac aggtgcccac cacgatgccc ggctaatttt 17640 

ttgtattttt agtagagaca gggtttcact gtgttagcca ggatgatctc aatctcctga 17700 

cctcgtgatc cgcccgtctc agcctcccaa agtgctggga ttacaggcgt gagccaccgc 17760 

gcccagcctt tttttttttt tttttttttt ttttgagata gagtcttgct ctgtcgccca 17820 

ggctggagtg cagtggcggg atctcagctc actgcaagct ccgcctccca ggttcacgcc 17880 

attctcctgc ctcagcctcc cgagtagctg ggactacagg cacccaccac cacacctggc 1794 0 

taatgttttg tatttttagt agagacgagg tttcaccgtg ttagccagga tggtctcgat 18000 

ctcctgacct cgtaatccgc ccgcctcggc ctcccaaagt gctgggatta cacgcgtaag 18060 

ccatggcgcc cagcccatgt ggccattttt cagtgagaga agccagaggc ccatcactct 1812 0 

cggttgctcc ctgggccatg ctctgcctca gccagaagca ctgagggaag gtcagcctcg 18180 

gcccttgccc cagccacagt cacagataaa ggggcctgca caggtctgtg tggctccaga 1824 0 

gctcgtcacc caacacacga cgcttccatg tgaatagccc caggtgcatc atgaagagcg 18300 

atggccgctg cagaggcaga agaatcccgc ggggaagcag gtgggagaga ggctgagaac 18360 

agaccagacc ctggagctac agaccctatg ttccaaccct ggctgggact agctgtgtgg 1842 0 

ctctgggcaa attcacatgc ttctctgtgc acaggggatc aaaatagcaa acacaggcta 1848 0 

ggcacagtgg ttcacaccta taatcccagt gctttgagag gccgaggtgg acacatggct 1854 0 

taagctcagg agtttgagac cagcctgggc aacatggtga aacctcgtct ctacaaaaaa 1860 0 

aataccaaat aaattagcca ggcgtggtgg tacgtgcctg tggtctcagc tacttggaag 18660 

gctgaggcgg gaggaacact tgagcccaag aagtcaaggc tgtggccgcg tgtggtggct 1872 0 

cacgcctgta atcccagcac tttgagaggc tcaggtgggt ggatcacttg tgatcaggag 18780 
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ttcaagacca gcctggccaa catggtgaaa ccccgtccct actaaaaaaa tacaacaatt 18840 

tgccaggcgt ggtggcgggc acctgtaatc ccagctactt gggaggctga ggcaggagaa 18900 

tagttagaac ttgggaggtg gaggttgtag ttagccaaga tggtgccgct gcactccagc 18960 

cagggggaca gagcaagact ccatcccaaa aaaaaaaaaa acaaacaaac aaacaaaaaa 1902 0 

agaggtcaag gctgcagtga accatgattg tgccaatgca ctccagcctg ggtgacaaag 19080 

tgagaccctg cctcaaaaca ataaaaatat aaataaaaat aaaacataat agcaaacgtt 19140 

tcatagaggt ggtatgagca ttaaatgaac tgataaacgt ccctggaaaa cagtaagtgc 19200 

tatggaagga ttcgctgccg ccaccgccac caccattagc atgtttcaac ctccatcacc 19260 

ctcactgtcc cctgtcacca tcctttgacc agggcactcc cagctgcagc ctttctatcc 19320 

tcttgtccac ccttcataac tgtaagatca ctcagctccc aagaaccaca gtctacaggg 193 80 

taaccacatt tccaaatctc aaaccagacc cgctggtctg cacttccagg gacaacagga 19440 

tattttcaaa ccagcccaaa agagatgtgt ggctcagcat aagaggaaca ggagaaactg 1950 0 

aggcctcttg ccctgagaat gagcttggaa gtggatgtcc cggcctcact caaaccttca 19560 

gatgactgag gcccagccag gagcttgagt gtaccctcag gtcataccct gagccagaag 1962 0 

cacccagcta atccactcct catcactgac tccctcccca taaaaaacct gtttgctgtt 19680 

tcaggctgtt aagttgtggg ctgttttgtt acacagcaat ggataactaa cacacgaggc 19740 

ctggcaagtg tggagcaaag ctgcccaagc cctcaagtct gttcatgtgg gtgttggcct 19800 

gtgtttgcag aaatccagcc actgagtcct cccatgcagt cactactgcc ctctgcacag 19860 

acacctgcca catccctgcc tgggccagga gctccactag tgcaggaatg gggtctgccg 1992 0 

tcccaggagg atccctgaca cctagcacag ggctagcagc aggcagcact tggttagtga 19980 

ataaactgcc cttcacctgt acacagaagg gatgtttcta taaggggtaa ttaagtacag 2004 0 

agctgggaag ctatgctgac cagaaggctc taaaagcaat taaccaacga ggggaaaacc 20100 

cttcctactc attctcggcc cattttattg agcactgacc atgtggaagg ccccctggtg 20160 

agactgggga atgcaccaat aactgagaca gcttccggct gttgccctca ggatgcctga 20220 

gctgggatag ggccagggtg ggggtggtgc gtgtgacagg gttactgttc acaaccctgc 202 80 

cgggccataa gccctcccca acaattccaa aatccaaaac gctctgaaga tggaaagctt 20340 

ttgttgctca tctggtgaca aaacctcatt tggtgcatgg gccgggtgcg gtggctcacg 20400 

cctgtaatcc cagcactctg ggagccgagg ggaaggatcc cttgagctta ggagtttgag 20460 

accagcctga gcaacatgtg agaccccgtc tctaccaaaa atacaaaaat tagccaggtg 20520 

tggtggcgca ctcctgtagt cccagctact cgggaggctg aggcgggagg atcgcttgag 20580 

cctgggaggt gggggctgca gtgagctgag attatgacat tgcactccag cctgggtgaa 2 0640 

agagtgagac tctgtctcaa aaaaacaaag ttaaaaaaaa aaaaactgtg catgggtgtg 2 0700 

ggctacagat agtcttttct gccctactta gaatgaacgt gccacatttg ctatagaaat 2 0760 

attcaagggc tggtggcaaa tgccacacag accctgacgc tgttccaagt tctgagaagt 2 082 0 

cctgcattcc tcagggcccc agagtttcag agaagagtct gtaggcctga gttaagaagg 2 08 80 

aacgccttca aaagccctgg ggacaaaggg gaaaggggtg ccccaggact gcgtgggtac 2 0 940 

ctaccggaac gagccgtcca ggttggcacg gtggatgaag ctgagcttgg cgtcagccca 21000 

gtagagcttc tgctcctcca ggtcgatggt cagtccattg ggccagtaaa tgtccgagtc 21060 

cacaatgatc ttccgggtgc tgccatccat ccctgcccgc tcaatccggg gcgtctcacc 2112 0 

ccagtctgtc cagtacatgt acctgtgacg ggggcagggc aagagaagca gctaacacag 2118 0 

atctgttttt tgttfcttgtc tgcatagatg cagacatgaa acaacagaca gtgaacttgc 2124 0 

cctaaaatct cacccatcgg aaataaccaa caggtatggt ttcaggtatt cctgccttaa 21300 

gctgggcaat caaaatatac tatttccaac ttgttctcag ttaacagtaa attctgggca 21360 

ccttcccttc ttgtggatag aaagattcct tgttcttttg atgattgcct agtgtactct 2142 0 

gctgtaagtt ttttaaagaa cttcaggtta tttctgattt ttttgctacc atgaaaatgc 21480 

tgtaaatgaa cctctaaaag gcaattcaaa acactcagga tggaatatta tttagtggta 2154 0 

taaagaaatg agctatcggc tgggcccagt ggctcacacc tctaatccca gcactttggg 21600 

aggccaaggc gggtggatca cgaggtcggg agatcaagac catcctggct aacacagtga 21660 

aaccccgtcc ctactaaaaa tacaaaacat tagccaggcg tggtagtgag cacctgtagt 2172 0 

cccagctact taggaggctg aggcaggaga atcatttgaa cccgggaggg ggaggttgca 217 80 

gtgagcagaa atcgcaccat tgcactccat cctgggcgac agagcgagac tccatctcaa 21840 

aaaaaaaaaa aagaaaagaa aagaaatgat ctatcaagcc atgaaaagac atggaggaaa 21900 

cttaaatgca tgttagtagg tgaaagagcc aatctgtatg agtccagttc taaacactct 21960 

ggaaaaagca aatacacaga gacagtaaag catcagtggt tgccaggagt tggagaggag 22020 
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agggatgaat gagtggagca cagaaaatca gggcagtgga actatcctgt atgacatgga 22 080 

atggtgggtg catgtcctta ctcatctgtc taaaccaaga atgtacaaat caagggcgaa 2214 0 

ccctcgtgta aacgtggatt ttgggtgatg gtgcgtcagc cagctttcat cagttgtaac 22200 

aaatgtacca ccctgcacag gatgctgaca gttgggaagg ctgtgtgggt gtgaggacag 22260 

ggatgtatag gaactcagta cctgctgctc atcaattttg ctgtgaacct acaactgttt 2232 0 

gaaaaaatta agtctattta aaaacaacaa aacatggcca ggcacgatgg cttgcacctg 223 80 

taattccagt acttcgggag gctgaggtgg gtgggtcact tgagccaccc tgggcaacat 22440 

ggcaaaatcc cacctctaca aaaaataaaa attaaaaaaa agttagctgg gcatggtggc 22500 

acactcttgt agtcccagct acttgggagg ctgacgtggg aggatccctt cagccctggg 22560 

aggtcgaggc tgcagtgagc tgtgactgta ccactgcact ccagcctgga tgacagagtg 22 620 

agaccctgcc taaaaaaaaa aaaaaaaagg ctgggtgcgg tggctcatgc ctgtaattcc 22680 

agcgctttgg gaggccgaga tgggcggatc acgaggtcag gagatcgaga ccatcctggc 22740 

taacacggtg aaaccccgtc tctactaaaa gtacaaaaaa aaaaattagc cgggcatggt 228 00 

ggcggacacc tgtagtcaca gctactcggg aggctgaggc aggagaatgg cgtgaacccg 22860 

ggaggcggag cttgcagtga gccaagatca caccactgca ctctcagcct gggagacagc 22 92 0 

aacactccgt ctcaaaaaaa aaagaataaa acccatggct gggatggacc ctgaacctgc 22980 

agctgcagct gttcctgggt aggtctgtgg gcgacgtggc tttgcttctc catgttccca 23040 

agagacaagc atcacccatc catgagaaac aagcacatcc tcagggcgcc cttacgtgat 23100 

ctctggccaa tgaaccaaga caaagtgagc agacaccagg tctgggatgg caggtcccac 23160 

ccccaccagt gcccagtgtg ccctgtttgg aggtgaccac agggtgtgtg cccagaggct 23220 

gggcgtgact ctcagcggag accagagggg aaccacacca gcttggagga ctcagttccc 23280 

atcccagcca gctgggatga gccacaggac acaagggctg gcagacctat tgtgttttgt 23340 

ccacccttca cagcagagaa aggggacagt gcccagaatg tcctctgagg agcctcctcc 23400 

cactcttggt ccttgtaaaa tggtgctgac tcccttgctc ccttcttcct ggggtgggcg 23460 

gcaaacccca ttcccctcag ccttagcaag tgatttagaa acaggcagct cgcccaagcc 23520 

aggcatgaga gtgatcccgg gacacaggga gaacaagccc cgctttgccc tctgggggtc 2358 0 

tccattcagc agaagaggca aatgacagac acacagccgc ctcctccccc accatggtgc 23640 

tctgcagcct caggagcctc aggtgcacca agggccaccc catccagggg gccatgcttc 23700 

cttgagtggt atcgttcctg agcgagtacc atctccacct tccagagggg ctgtgacaag 23760 

atcaacaaga atgagggcat aggagcctcg aaccaaacat gccctcttcc ctgcagaggc 2 3 820 

tgactgcgcc cagctgctat caccaagccc ctgctcctcc ggccccgtgg ggacagggta 23880 

agaggggtgt cacatggaac agctctccaa acagtccctc tcaagctgct gtctcctgtg 23 94 0 

catctagtga gaacccaacc aacaaaggga aggtgggaat tgctattccc attaggcaga 24000 

tgagaaaact gaggccccga aaggctggcc tgttccaggt tacaggcgct gagcggctgc 24060 

tctgggaaca cacttggtgt ctgctgaggg cccgagcccg gccatcatat gactcaccct 2412 0 

tcgccagcaa agcccgggtg tgggtgaact tttcctggca gcctgggact ccaaggtgct 24180 

ggcagccagc ccagggaagg ctcccgcgtg cctgcggcag acgccttgct ttacctgcac 2424 0 

gtccccaccc ctaggagcct ggacagagcc cagaccctcc gccacctcct gagaaggtat 24300 

caggggcatc agtctggact tgggggggaa tccacacagg ccttccccaa atgctccacc 24360 

gtggcccatg gaaaaggctg gaaaacgtgc aggagcagga gcctccgcat ggagcataat 2442 0 

tcacattcct tccccgagtt tcataacaga ggcctgctgg tttccttaaa tggggaattt 24480 

gcgagccagt cggtgaccag agactggttg gcgtggacgt gctcttgcag agtctcaaac 2454 0 

gctaccacaa gcccagccaa attccacgga ggaaaatcga cttccgaaga aaagagctgc 24600 

agcatggcct tcgtgcagag ccagctgcgg ttgtggttgt gtgttatttt agggaagggc 24660 

cattttgcat tttaaagagg gggttgggtt tcaccctggc tttaatttga gacccggggg 2472 0 

ccactgcagc cccttgtcag gctggtacag gccggggact cctcccatgc taagccagtg 24780 

tctttctggc cccagatcct caggggccag agggtcatcc ccagagcccg ctctgccacc 24840 

cacatgggta ccctgggcct gggagggatg tgccttccct caaccctgcc tggatgtccg 2490 0 

cacggggcca cctgcattgc tgaaactgca acgaagtcga gtctcaggag gggcccccct 24 960 

ggctgcaggg ctcttgatcc ttttggccac gtgcacactg aggtggacgc tcggacccag 25020 

agaccccctt catgatgatg gccggggcag gaaccccctc ctctgaggaa ggaccctggt 25080 

gggggacagc actgcaggag ggcacaggag atgacggggg ctctagcagg gccgggagga 2514 0 

aggccaagat gctcctcgca accgtgtgcc tgtggccagg acagaggaca aacccaccct 25200 

ccactgtccc cactctcagg acagcagtcc tgccccagga ctcagcgccc acacttatgc 25260 
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ctgaggacca 
acaggctgga 
agagtcagca 
gtgggagccg 
gtgtggggag 
ggagtgaaca 
gcatgtagag 
ctcgcagaga 
accaagggcc 
ctcccaatca 
cccaaccagg 
ttacctccca 
catctagcac 
gcagagcacc 
cctagcacct 
aagtaccaat 
ccaacaaatc 
caaccaacat 
tggacttctg 
ctcacaaact 
acaaggatac 
ctacatgcag 
cgccactcct 
cggtaataac 
ctccagccac 
acctcatgtg 
tcaaacaggc 
gaccgcgctg 
caaaagtcac 
acccaagcac 
tctcctccca 
accaggcgca 
cgacccacgc 
ccatggcagc 
gctatcttca 
ccagccgacc 
ggggagaggg 
gctggctccc 
aggggctcca 
tcaggtagtg 
gataaattcc 
gatgcagaca 
cacatgcgca 
gacacacaca 
cgctcacatc 
acaaacctgt 
tggcttccta 
cctgagctcc 
actaccctgg 
cccggctcct 
aagttggtgg 
ctgtcccaca 
caggcgggct 
tagaatatct 



ctattcaagt 
atcctctccc 
cctcagggta 
gcatctcact 
aggcccagga 
gccacaggtg 
gaaccatttc 
tgatgcaggc 
cagcatctcc 
aatacctccc 
caccaagtac 
accaagcgcc 
ctcccaactg 
tcccaaccaa 
ctcaactgga 
caccttccaa 
acctagcacc 
agcaaaagcc 
gtggcgagga 
gttcttggtg 
tgaggcacag 
agtgccagca 
tacccccaca 
cgtggagagg 
caggggatgc 
tcctctcctc 
cccgccccaa 
cccccagaca 
caagaggcct 
tgctttatcc 
ctcactgaaa 
agggctcaca 
tgggagggaa 
cgctggtgag 
gcctctccga 
cttccaaaaa 
ctgagagggc 
cggcagctgc 
gggatgagtg 
ggggcacgag 
accgtgtccc 
ctctcgcgca 
cactcataca 
ggctcacacg 
ctcccactcc 

tgggctctga 

agtcaccaca 
caaacaccag 
gcaggcccag 
gcagccggca 
ccccaggact 
gggcatggcc 
cctgtggcag 
aggtcaggct 



cagtatttgg 
tctccctctc 
ggaccatggc 
gttttcactc 
atcaacacta 
tgatcctggg 
aaatagttaa 
aaagccagct 
caaccaagca 
aaccaagcac 
ctcccaacca 
tagcacctcc 
atcacctccc 
gtgccaagca 
caccaacaac 
ccgagcacct 
tcccgactga 
ataaagaagt 
aagggcattt 
ttttcctcat 
agtggcaccg 
aatccagctg 
agtgaggtgt 
gctcctgccc 
cttccagcac 
acaccttggg 
accagcccag 
gccacacggt 
tcccagcacc 
ccctacgcac 
gggccagaga 
aatgcagtgg 
cgcaatgaca 
caaccgcagg 
ctactgcaga 
gaaggtcagt 
aacagttcac 
aggggtggct 
gttgcccagc 
gctgctctgg 
acccctgctg 
catactcgct 
cacacaaata 
tatgcacgca 
cacactcagt 
aaggctccag 
cagatcaaag 
ccttggtgag 
aggaggggca 
ccatgggagt 
agagcccagg 
aagggaaggg 
gcctggatgc 
gggtgcagtg 



cgagcagggg 
cctctccgga 
tggcgtcatc 
tctcattcaa 
aggtcaactt 
gagggcttct 
gaaaagccag 
cagatctgag 
ccaagtacct 
ctagcacctc 
agtgccaagc 
caactgagca 
aacctagcac 
cctcccaatc 
tcccaaccaa 
agcacctccc 
tcacctccca 
aaaaagacaa 
ttattataac 
gacatgcagc 
tgccaacctt 
tcttttctct 
ccccgctgct 
ccacgccacc 
gagtcagagc 
ccctgtttcc 
ggcctttgca 
tctcagcctc 
tgagctccga 
acgtcccttt 
ctgggctata 
atgcctggtt 
ggaggagcgc 
ccggccctgg 
tgccccctcc 
aaccccgcga 
caagcggaac 
agcccactcg 
acagcacccc 
cctgacctca 
gtaccccata 
cacacgggca 
tccactcaca 
tatgcgtgca 
tgctcagaca 
cccttcccat 
aggtgaactt 
gacagaccct 
tctgcaggat 
cagggggagg 
ggtcttcagt 
cttccagagc 
atggccgccc 
gctcacgcct 



ttgctgccgc 
gacatggagc 
agcatcactg 
atgactggag 
tgcccccagg 
gggagagaat 
agccaaacag 
catgggaaag 
cccaaccaaa 
tcaactggac 
acctcccaac 
tcatgcacct 
cgagcacctc 
aaatacctcc 
gcgccaagca 
aactgagcat 
acctagcact 
aaccacgtag 
gacagctaac 
atggtcacgc 
gtctcatctt 
tcagaacaga 
gctttctgtc 
caccccacag 
tggcacctcc 
ctacattctg 
ctggctgatc 
atctgcttcc 
cggaagcccc 
caaatactat 
cccgctgcgt 
gggaggtgag 
aggtcctggc 
gagagggctt 
tagccagaga 
ctcctggagc 
agaggctgcc 

gagggcagcg 
tgggaggccg 
gggactcaaa 
cttacacaca 
catacacgtg 
cgcatgcatg 
cacgcacaca 
cacacacgcc 
gctcgtcaga 
ggccacatgg 
caccccacac 
ctggcaacca 
tcactgcaaa 
cctactccag 
cctgacttca 
actcctggga 
gcaatcccaa 



gggcgctgtg 

ctacagggac 
gatctgatga 
caaagggaag 
ggcaggggtg 
tcagaggcaa 
ggacagttgg 
actactccca 
tgccaagcac 
accaactact 
caagtaccaa 
cccaacagag 
ccaaccaagt 
caaccaagca 
cctcctaaca 
catgcacctc 
gagcacttcc 
gcatggagac 
atttgttgaa 
ctctgtacag 
tttatcgaac 
tcccaaatct 
gccaggatcc 
ctcactctcg 
tctgctcgag 
ctacagcccc 
cctctgcctg 
agtctcgact 
tcgccacagc 
tcatttacca 
ggggagcagg 
ggagctgcag 
gacacgatgg 
ctagcaagct 
cactgctaca 
cacagtgcag 
ccggaggtca 
agggcatacg 

ggggcacttc 

atactttggc 
gactggttca 
cacacacagt 
cacacacacg 
cacacacaca 
tggctctcac 
agccagtcaa 
cactctgctt 
cctcattccc 
gcccctcccg 
gggcaacagc 
agcttggaca 
gggaggaggg 
ctttctaacc 
cactttggga 



25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25660 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
27600 
27660 
27720 
27780 
27840 
27900 
27960 
. 28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 



-48- 



BNSDOCID: <WO_ 



_02092000A2_I_> 



WO 02/092000 



PCT/US02/14877 



ggccgaggag 

aaaccctgtg 

cacagctact 

gtgagctgag 

tccaggctgg 

tccaactgag 

accagttctg 

acagcccagg 

gacaaaggag 

gacctggcct 

ttcgactcca 

tgagccttca 

gcttccagaa 

actttttcag 

gtccgcactt 

ccatggccaa 

ggccaggcac 

tcacctgagg 

aaaactacaa 

gctgaggcag 

ccactgcact 

aaaaaggaac 

tgaaagctga 

gagtgcagtg 

cctacctcag 

tttgtatact 

gactcaggtg 

cacccccacg 

cccccccacg 

cccccccacg 

cccccccacg 

cacccccacg 

cccccccacg 

gcaaattcga 

agggtttacc 

aagagcacct 

tccgtccagt 

ccggagatga 

gcctcctcgc 

gccgcatcct 

accagccgta 

caaaccgtga 

gcccagagca 

cccctaaggc 

ctgcagcgca 

ctgttacgtc 

atcgtcctcc 

gttggcctga 

agacccagcc 

aagattccct 

catcagcttc 

tacatttaaa 

tgtgccagtg 

tctggcctca 



ggtggatcac 

tctactaaaa 
caggaggctg 
atcgtgccat 
gagtgctgag 
cctgggggtg 
acactgaccc 
taagggtgtg 
ggacaagggg 
agctggccag 
aagctttcct 
agggcccagg 
cggacccgag 
ccaatgcaat 
cccaggccgc 
ccacatcggt 

ggtggttcac 

tcaggagttc 
aaattagctg 
gagaatcact 
ccagcctggg 
ataactatgg 
gctctttttt 
gtgggatcac 
cctcccaaga 
ttgaagagat 
atctgcccgc 
ggttcatttt 
ggttcatttt 
agttcatttt 
ggttcatttt 
ggttcatttt 
ggttcatttt 
gagagagatg 
cgtgagcggg 
tccgggatgt 
acagcttctt 
ccacgttctg 
tcacgtctgt 
ccaggccgct 
cgtcccggcg 
gggccactgg 
ctcctggtac 
gttgtcagga 
gcaaacagcc 
tgttccccca 
agaggctaca 
cacaggggat 
acagggccac 
tacaagtctt 
ccatccaccc 
gggtcactct 
ccatgcagaa 
cctacacttc 



ttgaggttag 
atacaaaacc 
aggcaggaga 
tgcgcaaaga 
agactgtggt 
gtgggaaatg 
agcacagaac 
tgtacggccc 
tggccaagca 
tggagctggg 
gccacccacg 
tgaactcagc 
aaccactccc 
tagaaagcca 
ccaccataca 
tgcctcagac 
gtctgtaatc 
gagaccagcc 
ggcgtggtgg 
tgaacccggg 
caacaaggga 
agtctcaagg 
ttttttgaga 
agctcactgc 
agctgggatt 

ggggtctcac 

cttggcctcc 
cagagtcgca 
aagagtgaca 
cagagtcgca 
aagagtgaca 
cagagtcaca 
cagagtcaca 
gtgacactcc 
gtccaaggcg 
gccattgagg 
gcccacccag 
cacggcggcc 
ccagtacacg 
gaccacgatg 
gttggcaaat 
ctaagccagc 
tgccctgccc 
agttgcctgg 
ttcttcccac 
gatatgggtg 

ggggccctgt 
ggggacatgc 

attttttgag 
tgaccactgt 
ctgtattcca 
ggaatgccac 
caaggtcaga 
tctccaccac 



gagtttgaga 
tagccaggtg 
atcacttgaa 
agatctaggc 
gacactgaat 
gctcttgtgt 
ctgacaggtc 
ccagagtcac 
aactgttccc 
ccacctcccc 
ctctccccac 
cagactagga 
agcaggttct 
ggagaaggtg 
cacagcaaag 
atgcaagttt 
ccagcacttt 
tagacaccat 
tgggcgcctg 
aggcgaaggt 
gactccatct 
ggaagtaatt 
caggatctcc 
agcctcgatc 
acaggtgcat 
catgttgccc 
cagagtgctg 
ccgagtgctg 
ccgagtgctg 
ccgagtgctg 
ccgagtgctg 
ccgagtgctg 
ccctttttct 
ccgccccctg 
atggccctcg 
ttggccacct 
tcgcaggcga 
cccgtctggt 
gctcccttgg 
gtggactcca 
agcaggagcg 
aagatacaca 
atgcccaaga 
gcagccccgg 
attcggtgac 
acccttcctg 
cctgttctgc 
aggcacagca 
gggttcagag 
ccaagctcag 
tgtttctccc 
cccttggctc 
ccccctagcc 
cccaccccaa 



ccagcctggc 

tggtagtgca 

ctcgggaggt 

cggcccctca 

gaactaacag 

tctagtcaag 

agcaagggcc 

tcccaggctg 

tctgctcggg 

ttaaactctc 

ctgggatcac 

gctgaggagg 

gcttccagac 

caaattccac 

atgtgtttaa 

taaaaaggaa 

gggaggccga 

ggtgaaaccc 

taatcccagc 

tgcagtgagc 

caattaaaaa 

ccttcaacaa 

tcactttgtc 

tcccaggctc 

accatcacac 

agtgtggtct 

ggattacagg 

gggttacagg 

gggttacagg 

gggttacagg 

gggttacagg 

gggttacagg 

gaaaaacaac 

gacccaggtg 

gctggtcaag 

cgatgcggtt 

ggccgtcggg 

tcaggtaggt 

aaaactggaa 

gcttgactcc 

gcgaggctgt 

gccctgggat 

cctccagttc 

cccgcatcat 

agcacctgtt 

ccatgcccaa 

agagaagcca 

ctggccatgc 

cccaggccag 

gcccgtttcc 

accctgcttc 

agacaccttc 

tggcctccaa 

gcattcctga 



caacatggcg 
cgcctgtaat 
ggaggttgca 
accggtgagg 
gcaaagggct 
acctctgcca 
agggcttagc 
caagaaaagg 
agtctggggt 
caccccggac 
ggccaggccc 
acacagggca 
aaggagctgc 
ctgcctgagc 
ccattcaaac 
cataactatg 
ggtgggtgga 
catctgtacc 
tacttgggaa 
cgagattgtg 
aaaaaaaaaa 
taacaaatct 
gcccaggctg 
aaatgatcct 
ccgattcatt 
tgaattcctg 
cctgagccaa 
cctgagccaa 
cctgaaccaa 
cctgagccaa 
cctgagccaa 
cctgagccaa 

ttgggctcat 

gagtcgcagc 
gtcctgccag 
ggtctctgag 
agagaccagg 
ctgcttgatg 
gtccactgcg 
gccggcgtcc 
ggggcagaag 
ggagcactat 
cttcctccca 
tcagaggctc 
tgtttaccaa 
aacctcccac 
catccccttt 
tgctcgctac 
acagagcctc 
ttggccgtgg 
tggacattcc 
cacagctccc 
ggccttggcc 
tctgcctgcg 



28560 

28620 

28680 

28740 

28800 

28860 

28920 

28980 

29040 

29100 

29160 

29220 

29280 

29340 

29400 

29460 

29520 

29580 

29640 

29700 

29760 

29820 

29880 

29940 

30000 

30060 

30120 

30180 

30240 

30300 

30360 

30420 

30480 

30540 

30600 

30660 

30720 

30780 

30840 

30900 

30960 

31020 

31080 

31140 

31200 

31260 

31320 

31380 

31440 

31500 

31560 

31620 

31680 

31740 
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gccaggctgg 
agaagcctca 
agaggccttc 
aacccccaac 
gacaggagga 
ggttaggctg 
gctgacattc 
agccaacccc 
ccaatgacaa 
gaaagatttg 
gagaccccag 
ctggctcaaa 
cacagaggac 
catcatctta 
ggaaggcata 
catcaaaaga 
caaatcatag 
acaatgagaa 
ccaaagatga 
catcagggaa 
tactatcaaa 
tgcaccactg 
ccaaaaactt 
caaaagaact 
ccttcccaac 
gcactttggg 
ccaacatggt 
cacctgtaat 
ggacgttgca 
tctgtctcaa 
gcgtccctct 
actattcagt 
cccagcactt 
ctggccaaga 



ctccctcacc 
tctcacagac 
agaccatcac 
acagactcag 
ggttaagggc 
ggctggccac 
ctcatgaaaa 
acactcccac 
tcacagaact 
cagcctctat 
cgcagtgggg 
tcctggcccc 
ctgaggccag 
ggtctacagt 
ggccacaaat 
cactatcgag 
atctactaag 
aacaactaag 
catataaatg 
atgcaaatca 
aaaacaaaat 
gcaaaaatgt 
aaacacagaa 
gaaaacaggg 
agctaaaaca 
aggctgaggc 
gaaaccccgt 
cccagctact 
gtgagccaag 
aaaacagcaa 
actgagggat 
ctcaaaaagg 
ggggaggctg 
ctggcaccna 



tccctgtgca 
aacggtctca 
cccaccagac 
agggactaag 
ctccagggag 
aaacaccccc 
ctgcagaccc 
cttcaccaag 
gaatcctgta 
gagggtagct 
tagatggttc 
tcccattccc 
gaaaatgaca 
tctttgtggc 
aaagacaaac 
agagtaaaaa 
agattaatat 
ccaactcaaa 
gccaataaac 
aaactacaat 
agcaagtgtt 
gaaatggtgc 
ttaccatatg 
acacaatcag 
tggaggcagc 
gggtggatca 
ctctactaaa 
cacaagtctg 
attgcgccac 
aacaaaaaca 
gaatagcggg 
aaaaagattc 
aggcgggtgg 
gctacacana 



ccgcagccct 
ttcccacaac 
acctcagacg 
aagccacatg 
ctccatcagt 
aaaaaacatc 
ctgggtcctc 
aaagagaaag 
gttagttcag 
gggaacacaa 
cgagcacagg 
aggagacatg 
gcggcgcctc 
aacgagggac 
gggacttcat 
ggcaacccac 
ccatgaaata 
aatgggcaaa 
acatcaaaac 
aagataccat 
ggtgaggatc 
agctactatg 
acccagcaat 
atatgcatac 
caggcatggt 
cctgaggtca 
atacaaaaat 
aggcaggaga 
tgcattccag 
aaaaaacaaa 
gcaaaatctg 
tggtcaggca 
atcacctgaa 
aagtatangg 



cagccccttc 
gggctcaatg 
tcggaccagg 
aggagtgaac 
ccgtgttctg 
tgaagccttg 
ctgcgcagat 
ccaaaacaaa 
ttggtttcat 
agggccagag 
cctccctgcc 
ctccacagga 
cgccgcccca 
actgtgaaag 
gggaagctaa 
agaatgagag 
cagagaactc 
caacttgaac 
aggcttaata 
cttgcaccaa 
tggagcaact 
gaaaacagca 
ttcgctttgg 
accttggatc 
ggctcacgcc 
ggagttcgag 
tagctgggcg 
atcacttgaa 
cctgggtgac 
caaacatgga 
ctccatccac 
cggtggctca 
gtcaggaatt 
ccccggaaa 



tgcctgtgca 
agaaatcagg 
agggtccagc 
acaagatgtg 
ctgtcagcag 
gcttgaaaca 

gggggagccc 

ctcaactcag 
ttcagcaggg 
catggcccag 
aagacaagca 
tgggaggaca 
cccgtgctgt 
tcaaacaaca 
agattttgtg 
aaaatatttc 
ctaaaactca 
agacatttct 
tatccctaat 
ttaggacggc 
ggaacccttg 
tggcagttcc 
gttatatacc 
acagcagcat 
tgtaatccca 
accagcctgg 
tagtgacggg 
ccctggaagt 
acagcgagac 
agcaacccaa 
acaatggagt 
tgcctgtaat 
caaggcccgc 



31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
33600 
33660 
33720 
33769 



<210> 9 

<211> 72049 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 

<222> (8356) , (8385) , (38585) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 9 

tataccttgc 

cccataattt 

ttacggatca 

ctttgggagg 

agcctggcca 

tggtgggtgc 

aggagacaga 

cgagactctg 



gcggaccttc 
tgccacacag 
aaatcgactt 
ctggagttgg 
acatggcgaa 
ctctaatccc 
ggttgcagtg 
tctcaaaaaa 



ggctcctgtg 
acttagttgt 
ttagggtcag 
ggttgggggg 
actccatctc 
agctactccg 
agccaggatc 
aaaaataaaa 



gtgaagacaa 
gtccatgtat 
gcgcggtggc 
tggatcactg 
tactaaaaat 
gaggctgagg 
acgccactgc 
ataaaataaa 



tatgaagaaa 
cttgtgcacc 
tcacacctgt 
aagatcagga 
aaaagattag 
caggagaatc 
actccagcct 
taaatacata 



atagaaatta 
ttttttctgt 
aatcccaaca 
gtttgagacc 
ccaggcgtgg 
gcttgaaccc 
ggcaacagag 
aattgacttt 



60 

120 

180 

240 

300 

360 

420 

480 
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taggagattg gttcaaacaa tgtgtgtaat gttgtgtctg agtgtttttc atttatcgtt 54 0 

catgcaaatt ccgacatcat tcactcttct ccagagtgtg ctgttttcct gcctgtgtca 600 

tcacccgtca ccttgaatgc cctcgtttag gtaaaataag tacattttat tcaaaaatat 660 

ttgaggacat ttgggttgtc tccaggttct tggtcttgag ttttgctgtt cttgtggagc 720 

catggtggtg tctggttgca ggaacctcca tgcgttccag ctgctgcttc tgcctgtgtt 780 

cttagagagg aaatgctggg gtccgcggtt cccgggctgc tgaccaggaa gcctgcggtg 84 0 

ctttacggcc cttccagaag cgggagatgc ccccacttaa gtgtcagaca ggcctttcca 900 

cctcactggc agctctgagc ggctcccttc tatttgcaga tgactgagaa gttaccaatt 960 

tccacgttta ctgactgctg tttctcctgt taatttgtat ttatagtctt cgctaattta 1020 

ttgctagggt tttggtgttg tccctattga cttgtatgcc ttttaatttt ttaaacaaca 1080 

ttaatatact tcattttttt agagcagttt taagtttaca ggaaaattaa gggacaagta 114 0 

cagagagttc cttccacctg ctgtcctcct ctcctcctcc ccaccttccc tccttcccct 1200 

attgtaactt tctttctgat attataaaag tcactcatgg ctgggcgtgg tggctcacgc 1260 

ctgtaatccc agcacgttgg gaggcagagg caggcagatc acctgaggtc aggagttcca 13 2 0 

gaccagcctg gccaacatgg tgaaaccccg tctctactaa aaacacaaaa agttagccag 13 80 

gcgtggtggc gggcacctgt aatcccagct actcaggagg ctgaggcagg agaatggcgt 1440 

gaacctggga ggcagaggtt acagtgagtc gagatcgcgc cactgcactc cagcctgggc 15 0 0 

aataagagtg aagcttcgtc tcaaaaacaa agtcacacac gcttcttgta cgagggtcat 1560 

ttggccgagg ggccagatgg ctcaccatct agttgggaca ggccatgagc tcggaatgct 162 0 

ttttacatat ttacatggtt gagaagaaaa tcaggagaat aatgttttgg gacatgggaa 16 8 0 

aatgacatgg aatttgcatt ttagtgtcca taaatgaagt tttgtttgct cccagctgtg 1740 

ttgactgagg caggctggct tcctacagct gcggcagagc tgaggaggcg ggaaggagac 1800 

cgtgcaggcc gcagcaccga aaatatttgc tctctggccc ttcccagagt gcttgccgac 18 60 

ctctgtccga cagctagaag gaaggatagg acccgtccga cgataaccac tgttgacatt 1920 

tgagcgcgtt tccttcccgg cttttgtgtg agagtggcag tctgtttgct tttgtggtcg 1980 

ggatctgctg cacgcacggc gggctgtttg catgaggctt cctggaggat agggctgggc 2 04 0 

tcggagctgc acgcagtggg gcgtgtcctg catgcagtgg ggcctcagaa gagagctgtg 2100 

gtgggcgggg cagtgccaac gctggtgggt gccaggcctc cacgctcaga tcagccccgg 2160 

cgacaggttt gggccaccct ctctctggcc tctgtgcagt ggcccaggcc gtctgctctg 222 0 

cctggcacac ttgcctctgt ccttccactg aagcgctcct cttaccctct gctcccggct 2280 

gggtacgttg aattgtgtcc ctcaaggaga tatgctaaag gtctaacccc aggaacctgt 2340 

gtatgtgatc taatttggaa acagggtctt ggctgatgta atcaagcgag gatgaggtca 2400 

ccctagagta gggggcctat atccacggtg ctggtgtcct catgagagca ggtgagcaga 2460 

cactgacact caggggtgaa ggctgcatgg agtcagaaca gggcttagtg cgatggcggc 2520 

cacaagccaa ggaactccaa gtatttcctg caacaccaga agctggaaga tgccaggaag 25 80 

gatcctgccc tggagccttc ggagggagtc tgtccctgca gacgtcttga cttttgattg 2640 

cagggatgca tgtcttaggg tgtgtggggg ggtgcatttc tgatgttaga agccacctgg 2700 

ttggtggcga tgtgtcacgg gagccctctg caggttctgc gtgtccatgt ggtcggggac 2760 

agaggtgggc agggacggac ggtgtcgagc tggacatgtc catgacgtcg gccatccctt 2 820 

gggatggctt ttttgttttg aggataaggc tgcctgccag gaagctgtgc cctgcctggc 2 880 

ccttgcccca agcccctggc ctgtgcttgg cctcgcggaa gggatgtcgc ccttctctcc 2940 

tgcatgcgtg cagggaggaa ggggagaggt cagcagcccg cctggaggag gctcgggcga 3000 

ggggaaggtt tcactttcag gcaatgttgt ggggctgttt aaacaacccc aaagaaaacc 3060 

atttggccaa actgttagtt tccaaacatt ttacttcctt ggtgtttaaa taaattccta 312 0 

ccaagactct gtagctggtc ccagggaagg agttggcctc tcttctttat agcccggcac 318 0 

agtcagtccc ctgcacctgc ccctcccaac cccaggcctg cttccccgtg gccatggctg 3240 

ctgcccggac ctctctacac acagaacctc ctggaggcca gctgtgggca ccagccttgg 33 00 

cagggctgtg gcggagccca ggctgctggt actctctctg cagctgctcc ctgctggcct 3360 

ggctggacag cgtccccacc accactgggg tcacctctgt gctggtcaca gctcactcag 3420 

accttcaggc aaatgggttg gatcctgcct ctctcccagg tgtctcagtc tctgcaaaac 3480 

tcaaaaacct cagaggcctt gcagcctgag gggtgtcaga gacacctcct tcgaatcagt 3 540 

aaacacctac agattcaccc cagcagtgaa aggactgctt cgccacagag gtttgattta 3 600 

ctcctaagta attggaaggg atgccgagaa taggttcctc atggtgggac tagaggccct 3660 

ctgctgacct agttaacaga gggctagggc tgggtgtgct cagcccctga aggttctagg 3720 
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cccatttggg acaccccgcc agaacctgcc acaacctgcc atgtggtgac agctacctaa 3780 

atcccagagg ctcttgagct ggagagcaga cctctcaatc tcagcaggcc ccccacacag 3840 

accccataac cctagtctgc cttcacagta cagttcgtgg ctatgtgttc acggatggtg 3900 

ttgttcacct aaggtctctg ccctgtgacc ccaagggcgt cctgagggca gattccaagt 3960 

ctgtttcgtc cacccctcct tccctagcag cgggtccagg gcctggcctg aactagcttc 402 0 

ccacagagat actggtggga tgatgaaggc agccaggcgg caagtgaaaa acgcacttcc 4080 

tgcatgtgct ggctcctggg attgaagtgt ttgaggaagc aaagtgaagt gagctttcct 4140 

cttgcggctg tgtgtccttg ggccgggagc ctaccctctc tgagcgttgg ggtccttgtc 4200 

agtagaatgg ggcatcctca tagctcaagg ggtggtgtgt gaaaattgtg ctattgtgtt 4260 

actttaatga tttttttttt ttcgagacaa agtctcaccc caacgcgcag gctggagtgc 4320 

agtggcgcga tctcagctca ttgcaacctc tgcctcctgg gttcaagtga ttctcctgcc 4380 

tcagcctccc aagtagctgg aattacagga gtgcgccacc aggcccggca tatttttcta 4440 

tttttagtag agagggggtt ttaccatgtt ggctaggctg gtcttgaact cctgacctca 4500 

ggtgatccac ctgcctcggc ctcccaaagt gctgggatta caagcatgag ccaccgcgcc 4560 

cggcctactt tagtgatttc ttaggaggac agagggaacg ggctggcaag acaggcttgg 462 0 

aatgtgtttt gggatcaagt gccggtttct gtctggcact ggcgttctct gtggggccat 468 0 

gatggacaca ctgctgaggt caagcgtgat tcgtcttgcg ctgtgcctgg cagtctcatt 474 0 

ggaaagttct gtagacatcg tgtggatggg gctcttcccg gccaagccct tggggacctt 4800 

ccaggactgt gatctcccca cagtggctgt taagcaggga cctttcgtga agtggagtct 4 860 

ctggtcccct ccaagtcata gctagacagg gactcgggca tcgccaagcc tggctgatta 4 92 0 

ttcactggat gaggagacag gcccagagag gggcaggaac ctgcccgagg tcacccagca 4980 

ggccccagag gtttcggtct cggattctcc ctgctcatcc ctggatgtag tgctgctgtg 5040 

gatgtggttc tgtgctgggg gctgtggaga gcagggggct tgtgccagga ccccagtgag 5100 

ggtggcgccc tcgccatgag gccgactgtt ggtatggggc ggccatccac tggggtgtgg 5160 

ggaggaacag ctttcctgag gaggaggtgg cgggaggaac agcttccctg aggaggaggt 522 0 

ggcggtgctg tgtgacctgg gccttgaagg acaggtccat tgtcaacaga acattttggg 52 80 

agtggagcct agagggagaa aatttgttga aattcagatt cccctccccc taccaataca 5340 

caccaaatca gatgcccctg accagatcta aatttggctc tcagagattt ccattgtagc 5400 

tgggcacttg gggaaccttc taagtgctgc ctctgcctct ccccagcctg cctgcctcag 5460 

tttccccagc cctgggcccg tgtcgctgtt gccatcacgt gggcgccctc tagtggagga 552 0 

atcagattat gcactccggg gcttggagca ggagtcagga ggggctcctg tctttccttg 5580 

aaacgttgga tgccgggatc ctggaacagt ctctgcattc ctcctggcga gaaccagagc 564 0 

ctgggcacag gggaccatct gttgtttgaa ggctgcagcc tggcagggca ctcaggagat 5700 

ctggcagttg gctgcagggc caggtctagg ggccagggca tcagggaggc tctgggctgg 5760 

ttcagccccg ggcccctttg cagattgtga cctgggcccc tgtgcagggg catggccaca 582 0 

ggatgctggg aggggtctct gaccctgacc ttcttggctc tgtgcatcct tgagaccaga 5880 

aaggtctgga acaaatgagt agacgatgcc ctaacctggg gagggagcca catcctgatc 5940 

ccagcaacct cgggaaggat ctgtcaggat tatggggcac cctgggggcc ccaagtctgc 6000 

atgggtctcc acttgcaatt tctgtaggaa gctctgataa atccaaactg ggggtcctag 6060 

gacacagtca gaaatgctga taccgttgtg tgtggagcct cgggccctgg gggtcaggag 6120 

catgtggagg gtgggccacg ggggttcaga agagaatcct gtaacccccc accccccaaa 6180 

ctgaagccca cttgagggcc atggctgaaa ggttgggggg tctccgtgcg tcctgtggag 6240 

tgggtggtga ggagtccttg ggtttgcacg cctctgggcc tgagcggcgg gaccccgtcc 63 00 

acagcggatc cctgggccct gttgctcaga tgctctcaga gtgttgctgt ggccacggag 63 60 

ggagcctgag ttaagcttct cttgtgccgg ttgtacgctg tcaggtcaca ctggtgagtt 642 0 

aggcagggca cagatgccca gagcagaggg aactttcctt ggggattcaa cacgtgcaag 64 80 

tcttaggggc tggcaaatcc tgccctcagc tagagagggg gcttttattt gagaccagaa 6540 

tcacctgagc atcctcctgt ccccagctgt gtccagcctg tctgcaggga catcctgaga 6600 

ggaccaggct ctcccctcat ccacctgcct aagtgccact ctgaaccctg tccacctgtg 6660 
ccgtggaggg gcgtgacctc aagctgctca gccagcagca ggcttggccc tggggggcag • 672 0 

cagagaccca ggtggctgtg gggtgggtgc ttcgtggcgt ggttctgaaa cttcgttgga 6780 

agtgtgtgga cagtgccttg cctgttctct gtgggaccct atttagaaac gaggtctgag 6840 

ttactggggg tcatcactgt gttctgatgg cccagctgtg tggaggccgc ggtgcagccc 6900 

catccaagga gccagggccc tgggtctagc cgtgaccaga atgcatgccc cggaggtgtt 6960 
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tctcatctcg cacctgtgtt gcctggtgtg tcaagtggtc gtgaaactct gtgttagctc 702 0 

ttggtgttcc tgaaagtgcc cccgggtctc aggcctcaga accagggttt cccttcatct 7080 

cggtggcctg ggagcatctg ggcagttgag caaagagggc gattcacttg aaggatgtgt 714 0 

ctggccctgc ctaggagccc cccggcacgg tgctggggcc tgaagctgcc ctcgggtggt 72 00 

ggagaggagg gagcgatgaa gtggcgtcga gctgggcagg aagggtgagc ccctgcaagg 72 60 

tgggcatgct ggggacgctg agcagcatgg ccagcagctg ggtctgcagc ctggtacccg 73 2 0 

gcgggacttg tggttggggc tggtttgtgg ccaggagagg ggctggcagg agacaagggg 73 80 

gactgtgagg cagctcccac ccagcagctg aagcccaatg gcctggctgt gtggctctca 7440 

gctgcgtgca taacctctca gtgcttcagt tctctcattt gtaaaatgag gaaacaaaca 7500 

gtgccagcct cccagaggtg tcatgaggat gaacgagtga ccatgtagca tgggctgggt 75 60 

gcgtgtcacc taacatcacc agcctttgca aggagagccc tgggggcctg gctgagtatt 762 0 

tcccttgccc ggcccacccc aggcctagac ttgtgcctgc tgcaggccct tgacccctga 7680 

ccccattgca cctgtctcca caggagccga ggaggtgctg ctgctggccc ggcggacgga 774 0 

cctacggagg atctcgctgg acacgccgga cttcaccgac atcgtgctgc aggtggacga 78 00 

catccggcac gccattgcca tcgactacga cccgctagag ggctatgtct actggacaga 7 860 

tgacgaggtg cgggccatcc gcagggcgta cctggacggg tctggggcgc agacgctggt 792 0 

caacaccgag atcaacgacc ccgatggcat cgcggtcgac tgggtggccc gaaacctcta 7980 

ctggaccgac acgggcacgg accgcatcga ggtgacgcgc ctcaacggca cctcccgcaa 804 0 

gatcctggtg tcggaggacc tggacgagcc ccgagccatc gcactgcacc ccgtgatggg 8100 

gtaagacggg cgggggctgg ggcctggagc cagggccagg ccaagcacag gcgagaggga 8160 

gattgacctg gacctgtcat tctgggacac tgtcttgcat cagaacccgg aggagggctt 822 0 

gttaaaacac cggcagctgg gccccacccc cagagcggtg attcaggagc tccagggcgg 82 80 

ggctgaagac ttgggtttct aacaagcacc ccagtggtcc ggtgctgctg ctgggtccat 8340 

gcgtagaaag ccctgnaaac tggagggagc cctttgtccc cctgncttca gtttcctcat 84 0 0 

ctgtagaatg gaacggtcca tctgggtgat ttccaggatg acagtagtga cagtaagggc 84 60 

agcctctgtg acactgacca cagtacaggc caggcctctt tttttctttt tttttttgag 8520 

atggagtctc actctgtcgc ccaggctgga gtgcagtggt gtgatctcag ctcactacaa 8580 

cctctgcctc ctgggctcaa gtgattctcc tgcctcagcc tcctgagtag ctgggattac 8640 

aggtgcctgc cactgtgctt ggctaatgtt tgtafcttttg gtagagatgg ggtttcaccg 8700 

tcttggccag gctggtcgca aactcctgac ctcaggtgat ccacctgcct cagcctccca 8760 

aagtgctggg attacaggca tgagccacca cgcccggtca ggccaggcct cttttgaaca 8 82 0 

ctttgcacac catgggtctt ttcatccagg ggggtaggta cagttgtaca gttgaggaca 8880 

ctgaagccca gagaggctca gggacttgcc cagggtcaca cagcaggatg tggcaggtgt 8940 

ggggctgggc ctggcagcgt ggctccagct ttccagcata gaaatctgtg aaagcagata 9000 

gtttgtcggt cggtagggga gactttctga gacccgcccc agcggctcag agggtagtag 9060 

ccaggggcct tcctgggggc tcataaccca gaacactgaa tgggaaaacc ctgatggagg 912 0 

aggcgcagtg gagctgtggg tgccgatggg aagtcccaga ggagctggga ggtcagtagc 9180 

ggtgctgccc tctgtggagc acttagtggg caccaggtgt gtttccaggt tcatggccct 9240 

gggacctgaa gctcagaagg tgaagtaact tgcccagggc acccgtcggg cagcggcggg 93 00 

cagaggattt gtgggctgtg gagcctgtgc tcgtggccca gccctggggg ttgtgagtgt 9360 

gctggccggg gagcttttcc tgcaagtgga ctggtgtcta ggagccagca tgtcaggcag 9420 

caggcagcgg gagtgcagca ggcagcggga gcacagcagg cagagggcgg ggctcgagca 94 80 

gccatccgtg gaccctgggg cacggaggca tgtgggagag ggctgctcca tggcagtggc 9540 

tgaagggctg ggttgtgccc cgaggagggt ggatgagggt aagaagtggg gtccccaggg 9600 

gctttagcaa gaggaggccc aggaactggt tgccagctac agtgaaggga acacggccct 9660 

gaggtcagga gcttggtcaa gtcactgtct acatgggcct cggtgtcctc atctgtgaaa 972 0 

aaggaaggga tggggaagct gactccaagg cccctcctag ccctggtttc atgagtctga 97 80 

ggatcccagg gacatgggct tggcagtctg acctgtgagg tcgtggggtc cagggagggg 984 0 

caccgagctg gaagcgggag gcagaggggc tggccggctg ggtcagacac agctgaagca 9900 

gaggctgtga cttggggcct cagaaccttc acccctgagc tgccacccca ggatctgggt 9960 

tccctccttg gggggcccca gggaacaagt cacctgtcct ttgcataggg gagcccttca 1002 0 

gctatgtgca gaaggttctg ctctgcccct tcctccctct aggtgctcag ctcctccagc 10080 

ccactagtca gatgtgaggc tgccccagac cctgggcagg gtcatttctg tccactgacc 1014 0 

tttgggatgg gagatgagct cttggcccct gagagtccaa gggctggtgt ggtgaaaccc 10200 
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gcacagggtg 
gcttgccgct 
acatttacaa 
actgtttacc 
cactgacatt 
acaccaacat 
accgacaccg 
taccaacact 
caccgacatt 
ctgacactga 
tctactgaca 
gacatttact 
caccagcatc 
ccaacgccga 
tttaccgaca 
gatatttacc 
caccgacatt 
cggcttgggg 

tggggtcatg 

ccctaaaatc 
cctcgggtgg 
cgccaagaca 
gcccagccac 
tctgctattt 
gccttgagag 
aggtgcctgt 
tttgaggcca 
tgtgagattt 
tgccctgctt 
gtttggtgat 
tccctttgct 
gtggacgcag 
agcctcgccc 
cctgccgctg 
tgtggtcggg 
ggtttcagtg 
tctcttgagc 
tgttgtcgtt 
tctgcctcct 
agtgcccgcc 
gttggtcagg 
agtgttggga 
ttctcttggt 
tgacattggt 
attttcctga 
cagggctccc 
cgagtgtgtg 
accttcaggg 
ccctaggagc 
ggcaaggccc 

gtgggagcag 
ccatgtactg 
gagtgcagtg 
tcctgcctca 



gaagtgggca 
ggcatgctca 
acaccgacat 
aacactgaca 
taccaacact 
ttaccgacac 
tttaccaaca 
gacatctact 
taccaacact 
catctactga 
ctgacattta 
gacactgata 
taccaacacc 
cgtttaccga 
ctgacattta 
aacgccagca 
tactgacacc 
gcaggccttg 
gacttctgct 
gagtgtgcca 
cccaacggcc 
gacaagatcg 
cccctgcagc 
ggccacatgg 
ggtagcgcta 
agacacagct 
gcccgggcaa 
aaattatttc 
aagacagaaa 
tggttcctaa 

gggtgcagcc 

cctcgccctg 
tgccgctgtg 
tggtcgggtt 
tttcagtggc 
gcctcgtccc 
ttttttgttt 
gttgttgccc 
tgggttcatg 
accacgcctg 
ctggtctgga 
ttacaggcgt 
gccagtgata 
gcctctcctc 
gctgcatcag 
aggggaggcc 
tctgtgttgc 
tggggaaccc 
tgcaggtgtg 
atgtttgcgg 
gtcagggact 
tctcagcagt 
gcgcaatcac 
ccctctgagt 



tccctgtccc 
gtcctccagc 
tgaccgacac 
tctactgaca 
atttaccaac 
caacatttac 
ccgacgttta 
gacgctggca 
gacatttacc 
cactggcatc 
ccaacaccag 
tctactgaca 
gacatttacc 
cgccagcatc 
ctgacactga 
tctactgaca 
aatatttact 
ctcttggcac 
tcttctccag 
acttggatgg 
tggccctgga 
aggtgaggct 
cagatgtacg 
aatgcttgag 
tacaacgtcc 
acttggagac 

aggggacaca 

acaatacaaa 
acatgggtgt 
gcagttgaag 
tcgccctgct 
ccgctgtggt 
gtcgggtttc 
tcagtggcct 
ctcgtcctgt 
atgggcgtgc 
gttgtttgtt 
aggctggagt 
ccattctcct 
gctaaatttt 
actcctggtc 
gagccaccgc 
tgtacatgtg 

ggggtggatg 

tccttgttga 
cgaatcaggc 
aggatttcag 
agggatgggg 
gccatttcct 
gccgctgagg 
cgcgtgtctg 
ggttttcttg 
ggctcgctgt 
agctgggact 



a gggg a gccc 

acttactgac 
cgacatttac 
ctggcatcta 
actgacatct 
caacactgaa 
ccgacaccga 
tctactgaca 
aacactgaca 
tactgacact 
catctaccaa 
ctggcatcta 
aacaccagca 
taccaacact 
catctactga 
ctgatgttta 
gacatcaaca 
tggggatgct 
cctcatgtac 
gcaggagcgg 
cctgcaggag 
cctgtggaca 
tattggcgag 
aaaatagtta 

tgtggttacg 

tgaggtggga 
ggaatcctct 
attaagacaa 
tgttgaagcc 
gcagttttgt 
gctgtggtcg 
cgggtttcag 
agtggcctcg 
cgtcccgtgg 
ggacgcagcc 
tttggcagct 
tttgtttgat 
gcagtggcgc 
gcctcagcct 
gtatttttag 
tcacatgatc 
gcccagccct 
tccccatcgc 
cctccctctg 
cgtcaccatc 
agcttgcagg 
accctgcttc 
gaggctgagt 
gctggagctc 
atgatttgcc 
tggcacacca 
agacagggtc 
agcctcaatc 
acagacacat 



ccagggactc 
accagcatct 
cgacactgac 
ccaacactga 
actgacattg 
atttaccgac 
catttaccga 
ccgatgccag 
tttaccgaca 
gacgtttacc 
caccgacatt 
ctgacaccaa 
tttaccaaca 
gacatttacc 
tactggcatc 
ccaacaccga 
tttagccatg 
gcagagacca 
tggacagact 
cgtgtgctgg 
gggaagctct 
tgtttgatcc 
gcaccgatgg 
caatactttc 
taagatgtta 
ggatcgctgg 
gcactgcttt 
aaagttaatc 

agaggcagct 

ttttccatag 
ggtttcagtg 
tggcctcgtc 
tcccgtggac 
acgcagcctc 
tcgccctgcc 
ttttgctcac 
tttgtttgat 
gatctcagct 
cccacatagc 
tagacagggg 
cacctgcctc 
ctgttgagca 
accatcgtca 
tttccagcaa 
ggggtcacct 
gcagggcagg 
tgagcgggag 
ggacgccctt 
cttgtaaatg 
ttcacgcatc 
ggcctgtgac 
tcgctcgctc 
tccctgggct 
accaccacac 



tggtcactgg 


10260 


actgacacca 


10320 


atttaccaac 


10380 


catttaccga 


10440 


gcatctacca 


10500 


accgacattt 


10560 


cactgatatt 


10620 


catctaccaa 


10680 


ttgacattta 


10740 


gacactagca 


10800 


taccaacact 


10860 


catttaccaa 


10920 


ccgatgttta 


10980 


gacaccgaca 


11040 


taccgacact 


11100 


catttacgag 


11160 


tgatgggggc 


11220 


gacagactca 


11280 


ggggagagaa 


11340 


tcaatgcctc 


11400 


actggggaga 


11460 


aggaggccag 


11520 


gtgcctgtgc 


11580 


tgacaaaaac 


11640 


tcattcggcc 


11700 


agtccaagag 


11760 


tgccacttac 


11820 


acatatccac 


11880 


gctggcctga 


11940 


atgtctgttc 


12000 


gcctcgtccc 


12060 


ccgtggacgc 


12120 


gcagcctcgc 


12180 


gccctgccgc 


12240 


gctgtggtcg 


12300 


ctgtggagcc 


12360 


tgtttgtttt 


12420 


cactgaaacc 


12480 


tgggattaca 


12540 


gtttcaccat 


12600 


ggcctcccaa 


12660 


tattttgagg 


12720 


cccattgagg 


12780 


cttctgaagg 


12840 


ttgctctcct 


12900 


atggagaaca 


12960 


gagtttcagc 


13020 


cccacgaaaa 


13080 


ttttgttttt 


13140 


cccgctaccc 


13200 


aggcgttgtt 


13260 


acccaggcga 


13320 


caggtgatcc 


13380 


ccagctagtt 


13440 



-54- 



BNSDOCID: <WO 02092000A2J. 



WO 02/092000 PCT/US02/14877 

tttgtgtatt ttttgtgggg ggagatgggg tttcgctgtg gtgcccaagc tgatctcaaa 13500 

ctcctgaggc acaagcgatc cacctgcctc ggcctcccaa agtgctggga tgacaggcat 13 5 60 

cagccgtcac acgcagctca atgattttat tgtggtaaaa taaacatagc acaaaattga 13 62 0 

tgattttaac cattttaaag tgaacagttc aggctgggcg tggtggctta tgcttgtaat 13 680 

cccagtactt tgagaggctg aggtgggcag atcacctgag gtcaggagtt tgagaccagc 13 74 0 

ctggccaaca tgatgaaatc cagtctctac taaaaataca aaaattagcc gggcatggtg 13 800 

gcaggtgcct gtaatcccag ctactcggga ggctgaggca ggagaatcgc ttgagcccgg 13 860 

gaggtggagg ttgcagtgat ctgagatcat gccactgcac tccaatctgt gtgacagagc 13 92 0 

aagactctgt cttgaaaaat aaataaataa aaaaaatttt aaaaagtgaa caattcaggg 13 980 

catttagtat gaggacaatg tggtgcaggt atctctgcta ctatctactt ctagaacact 14 04 0 

ttcttctgcc ctgaaggaaa ccccatgccc accggcactc acgcccattc tcccctctct 1410 0 

cccagcctct gtcaaccact aatctacttt ctgtctctgg gggttcactt cttctggacg 14160 

ttttgtgtga ctggaatcct gcaatatgtg gtccctgcgt gtggcttctt tccatagcat 1422 0 

tgtgttttcc agattcaccc acacattgtc gcacgttatc agaatctcat tcctgactgg 142 8 0 

gtgcagtggg ttaggcctgt aatcctaaca ttctgggagg ccaaggcggg acgatcactt 14 3 40 

gaggcaggag tttgagacca gcctggccag cctagcaaga ccccagctac caaaaaattt 144 00 

taaaagttaa ctgaacgtgg tggtggtggg cacttgtggt tcccagctac ctgggaggct 14460 

gaggttggag gatcgcttaa gcccaggagg tcaaggctgc agtgagctat gatcgcacca 14520 

ctgcactcca gcctggacaa cagagcaaga ccctgtctga aaaaaaaaac aaaaaaaaaa 145 80 

gttcctttct ttttgtggct ggatgacatc ccattgtatg gccacagcac attttgtttg 14640 

tctgtttatc gggtggtggg cagtggtttc caccttttgt ctcctgtgaa taatgctgct 14700 

gtgaacattt gaattcaagt ttttgtttga acacctgttg tgaattattt ggatatatgt 14760 

gtaggggtag gattgctgag tcctatggta atgttaggtt tgacttactg aggaaccatt 14 82 0 

aaactgtttt caacagtggc tgcgccgttc tgcatcccca ccggcagtgt gtgagggttc 14 8 80 

tgactttacc tcctcacaaa cgcttctttt ccatttaaaa aaatattcag ccaggtgctc 14940 

tggctcacgc ctgtaatccc agcactttgg gaggccgtgg cgggcggatc acctgaggtc 15 00 0 

aggagttcga gacgagcctg gccaacatgg tgtaacccca tctctaccaa aaatataaaa 15060 

attagccggg tgtggcagcg ggcgcctgta atcccagcta cttgggaggc tgaggcagga 1512 0 

gaatcacttg aacccgggag gcagaggttg cagtgagcca agatcgcgcc actacactcc 15180 

agcctgggtg acaagagtga aactccatct aaaataaaac aaaaataaaa ataaataaaa 15240 

atttattaaa acattcatca cagccagcct agtgggtgtc ccatgtggct ttgcctcgca 153 00 

tttccctgat aactaggatg ctgagcgtct tgtcccaggc ttgccacacc tcagcacttt 15360 

gagatacgtc gcacagtccc catttgcgaa cgagaaatga ggtttaggga acagcagctg 1542 0 

tgtcatgtca cacagcgagc agggggtctc tgagccgtct gaccccacag ccgaccaagc 154 80 

tccaatcctt accgcctcct agtgttgtgg atgtagccca gggtgctccc acatttttca 15540 

gatgagaaca ccgaagctca aaacaggagc gttttgtcca cattggatac acgatgtctg 15 600 

tggtttggtc ctgaagtcac tttatatctc agtggtccag actggagtag gacagggggt 15660 

tctggggaat ggggaaggtg tctcaggtga aaggaaggaa ttccagattc tccatactgt 1572 0 

ccttgggaag ttagaagact cagagggtct ggcaaagtca gacaaagcaa gagaaatgca 15780 

gtcaggagga agcggagctg tccaggaaca ggggggtcgc aggagctcac ccccaggaac 15840 

tacacttgct ggggccttcg tgtcacaatg acgtgagcac tgcgtgttga ttacccactt 15900 

tttttttttt tttgaggtgg agtctcgctc tcttgcccag tctggagtgc agtggcacga 15960 

tctcggctca ctgcaagctc tgcctcccgg gttcatgcca ttctcctgcc tcagcctccc 1602 0 

gcgtagctgg gactacaggc gcctgccacc gcgcccggct aatttttgta tttttagtag 16080 

agatgggatt tcactacatt agccaggatg gtctcgatct cctgacctca tgatccgccc 16140 

gtctcggcct cccaaagtgc tgggattaca ggcgtgagcc accgcgcccg gcccgatttc 162 00 

ccactttaag aatctgtctg tacatcctca aagccctata cacagtgctg ggttgctata 162 60 

gggaatatga ggcttacagg ccatggtgct ggacacacag aagggacgga ggtcaggagg 1632 0 

tagaagggcg gagagaggga acaggcggag gtcacatcct tggctttcaa aatgggccag 163 8 0 

ggagagacac cctctgagca tggtaggaca ggaaagcaag attggaacac attgagagca 16440 

accgaggtgg ctgggcgtgg tggcttacgc ctgtaatccc aacactttgg aaagctgagg 16500 

tgggtggatt gcttgaggcc aggagttcaa gaccagcctg gccaacatgg tgagaccccg 16560 

tctctactaa atatacaaaa attagccagg cgtgatggtg catacctgta atcccagctg 16620 

cttgggaggc tgaggcagga gaattgctta aacctgggag gcggaggttg cagtgagccg 16680 
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agatcccgcc 
aaaaaaaaga 
gggggatcgt 
tggagctcat 
ggggcacggg 
cttctcagcg 
agcgttttca 
cactctctgg 
agagctgact 
gcccgtggag 
aagtggccac 
tccttgccca 
tgttcctgct 
aacacccgct 
ctgctgctgg 
acggctcctt 
tggggtcagg 
aggcaacaag 
ccgctcacct 
cagctgggct 
ctctgccttt 
acgggcagct 
ggtttcatgt 
gcggatgggt 
agcttccagg 
gaccagctgt 
cctccccacc 
gcatccgagt 
attgcatccc 
tcctgggcta 
ttcctgagtc 
ctcagcttct 
ctccccgccc 
ctgcccttct 
ctgtgccaca 
tgaggcccac 
acagcccagc 
ctggaagccg 
tccagggaag 
gtggcccttg 
tcatcatcac 
cagagaccct 
caggacgtac 
ccctgacttt 
gtctgttggc 
cactgcagag 
gtgtgtgttt 
aactgttgct 
acaacagaac 
gtttgcacag 
gactgtccag 
tgggctgctt 
ccttggtgtg 
agctggcagc 



actgcactcc 
taaaaagacc 
ggagcaggcc 
cactgggggc 
tgtctttggt 
ccctttatat 
gactacctca 
gaaggtgcag 
tagcctcgag 
atgctggctc 
cgctcttggg 
gcgccaggag 
gtggaccgcc 
gagtatgcag 
ggttgtggtg 
gaggatgcag 
gaggagacac 
gtggccaacc 
ttgctgggga 
gtgtggagct 
gaacacaatc 
gtggctgatc 
agatggcatt 
ctgggaggcc 
ccgccccagc 
tgccatcatc 
tgctggggcc 
cctgccccac 
cacttccctt 
ggaattccag 
tagagccccg 
cctgtgagcc 
acctgggctc 
cccctcctac 
agacatggct 
gctccagtgc 
cctctcccga 
gggtcactcc 
gtcctctttc 
tatgcccctc 
catcatcatt 
ctgaggtcca 
caggtctggg 
tgctaagtgg 
tgcactgtgc 
ctgtgtgtgt 
gcgtgcgcgc 
gtttcctttt 
tctccccttt 
ccgagtcaga 
agtcagcctg 
tgggatctcc 
ctggtcaggg 
tcggagagga 



agcctgggcc 
aaccgaggaa 
accctgtggt 
ctaaggagaa 
ggccagaact 
gcttacacag 
tcatgggtcc 
gtggatgccc 
attgctgctg 
tgtgttttct 
tatgtcctca 
caaaccctcc 
atctcacatg 
tgtgtgccag 
gcttctgggg 
ttggactcca 
atttggaaca 
caggtgctga 
gtggtgggag 
gggcgctagg 
aggctccagg 
ccttgatgaa 
agagacgccc 
cctcctgctg 
agcctgactg 
agtgacagct 
cacccagggc 
cacgtgtgaa 
ctgcttcaca 
tgttgtgctg 
ggggttggaa 
tccagcatgt 
agccagggga 
cctgcccttc 
gtgtgtgaaa 
cactcttccc 
accccagccc 
agccgtatgc 
caaatggcga 
cagggtctgt 
gtcatttcgc 
gcctgagttg 
tcaggagtga 
gtggaggtga 
tcccagggat 
ttgtgtgtgt 
acatgtgtat 
gggttttgtg 
aaacacgtgc 
ggaaacacag 
caggacggca 
cgtcattccc 
gggcgtgcgc 
cagggctgga 



acagagtgag 
ttgaagtggg 
catgcactgg 
tagatactga 
tggggactgc 
gctgtttcta 
ctttctttac 
agacccgccc 
gcacctcctg 
gctggagttt 
ggcttctgcg 
tggcactttg 
agggtcttgg 
ctgtgtccca 
actctgggga 
ggtggaaggg 
atgggaacat 
ggagcccaca 
agaggattcc 
aggaaggtgc 
ttttcagcat 
ttgggagaag 
acaacagatt 
gccttgactg 
acacacacag 
gggagtggcg 
agtcctgaca 
gcccgagtgg 
tagctgcctc 
gctttgcccc 
gtcctggccc 
cccctcagga 
aggcctggct 
tctcctctgc 
gtggcagggt 
acccgctggc 
catgtgccca 
catggtgggg 
cacctggtcc 
gcgctcggtt 
ttgtctgtga 
gggtctccgt 
cccccaaacc 
catcacttac 
ctggggagag 
gtgtgtgtgt 
aagatctttt 
ttcaacagag 
tgtcagaggg 
ggttcttcat 
gcagccctgc 
tcccagctgg 
ctgctctgct 
cccttgggtg 



actccatctc aaaaaaaaaa 



ggggcgtcac 
aagctcatta 
aggatgagga 
tggggtgcct 
agagggggat 
cctctgtggc 
tgccatccac 
ccccgggaca 
ggtgcgtctt 
agtcatggct 
ttcaggggtg 
gcctgcaggc 
ggcaatggcg 
cagctgaggt 
atggttgggg 
ttttaagatg 
ccagccctgg 
gttccacgtg 
tttctgcggg 
ccagtgcatg 
aacaaaggtc 
tacagagtgg 
tgacagctgt 
gggttttagc 
gtggttccag 
cctacaggtt 
tcgtgggctg 
ttctcaccgt 
aggacacctc 
ctgggacacc 
ccaagccctc 

gggagcgtct 

cccgccatgg 
ctggcatctc 
cgtgccctca 
gctgcccccg 
acatcctgct 
ctgcctggag 
ggcccgagtt 
gccggcctgg 
gctgacccct 
tcgtgccctt 
agcgggagtg 
gctatatccc 
gtgtgtgtgt 
tttattacat 
tggggtactt 
tgggtcttgg 
aaaaacactg 
ccctcagagc 
cagccggcgg 
caccctggga 
gcctctggct 



agtagcagaa 
cctgacgatt 
gtgatggcgc 
cactgcaggc 
acattgcata 
cctggtggcg 
ctgcacgtcc 
cctcggatgt 
ttcctcctgc 
gcttctcagg 
gatgcgccag 
tcgttcagga 
gggacagtgg 
gcaaggagcc 
gaggtataaa 
ctatgtcggg 
acgtgttttg 
gtggtgtgcg 
gctagccggg 
agaggacttc 
tatgaaatga 
agcggagacg 
cctgggaatc 
cccatcctgc 
ccctgggcac 
gcttggagcc 
aggtcccctg 
ttttccagcc 
cttagccctc 
tgcagccaca 
acgttcttgc 
cccctctgcc 
cttttatatc 
tgtgggtctc 
tgctggaggg 
gccctctccc 
tccttggcct 
gctggaagct 
cccatcaccg 
tctcccagag 
gacggggact 
tgacaggcac 
atgggacagg 
tgggctttgg 
gtgtgtgtgt 
gaagcaagat 
cttccctcag 
gctcatgtct 
cacagcaggc 
acagctaggg 
ccggcccatt 
atgggacaga 
ggaccatctc 



16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
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attgtcctca gacacagcct ctcgggtcta gtttcatttc ctgaaaaaca agtgcacaga 19980 

actagagcag gagtcgagag ctacggcccc cgggccagat ccagccctgc cacctgtttt 2 0040 

cacaccatgc tcaagctgag tgggttttac attttttaat tacttgaaaa aaaaaaagcc 2 0100 

aaaggaggtt tcatgaccca tgaaaattat atggaattca aaaaaaaaaa attatatgga 2 0160 

attcaaattt cagtgtccat aaataatttc ttgagacagg gtctcgctct gtcacccagg 20220 

ctggagtgca gtgctatggc atggctcgct gtacccttga cctcccaggc tcaagcgatc 2 0280 

ctcctgtctc agcctcctga gtagctggga ctacgggtgt gtgccaccaa gcccggctaa 20340 

ttttttttta attttagtaa agacagggtc tttctatgtt gcccaggctt ttctggaact 2 0400 

ccatcttggc ctcccaaagt gctgggatta caggctcgag ccacggagcc cagcctgttt 20460 

ttgttttttc actgataaag ttttgccggg tgtggtagtg tgtgcctcta gcgatttggg 2 052 0 

aggctgaggt gggaggatcg cttaagccca ggagtttgag gctgggctca agtgatcagg 20580 

aggtgaacta tgatcatgtc attgcattcc agcctgggtg acagagcaag aacctatctc 20640 

ttaaaaatat atatttaaaa agtattgggt gtggtggctc acgcctgtgg tcccagctac 20700 

ttaggcatct gaggtgggag gatggcttga gcccaggagt ttgaggttgc agcgagccaa 20760 

gatcgtgtca ctacactcta gcctgggtga cagagcccag accctgcctc tttaaaaaaa 20820 

aaaaccaaaa aacatgtatt ggaacacagc catgcctgtt cagtcacgtg ctctccatgc 2 0880 

tgctttctgc tccagagacc cttatggcct gaaagctgaa aatattttct atcctttaca 20940 

aaaaagtttg ctgacctctg tcctggaaaa ttcatctccc aagttctctt ccggcactgg 21000 

cgttcctggg tgtcctaaat ttggcccctg ttatttctga actctgtttt ggctctgttc 21060 

cctcccagga gccaggacag gcacgttctc tgcatcttgt cccctgacgc ccagaggctt 2112 0 

ggctcggctc aggcattctt ggaaatatct ggctccagga aaggcagagg cctcctgagt 21180 

cggcccagag ggaacctgcc ccaggtctgg gggaggcctg acccagcaga gtggcttttg 2124 0 

ccgatgggtt gggccggtca agatgtgctg aaagttgtcc tcagaaggcc actttgggat 2130 0 

tccttcctcc agtattagag caactgagag ctgctcattg caagcctgat gttttcccag 21360 

ttggccgggt ccaccgggtg ccctgggatt ctgggatctg ggtggaaagt agggggcttg 21420 

ggggagtgtc ctgggttctg gaatccaggt ggcaagtggt gaggttcagg gagtggcttc 21480 

tgagccacca taggggtctc tgtgggaggc tctgcccatc caggagattc cgcaggccct 2154 0 

gccggcccag agccagcgtc ttgcgcttgc cgaggctaca gccagcccca gccgggtgga 21600 

acagcccgtc gcctcctctc actttgtttt ggggccacct gggagtgtgg agcaagggta 21660 

gagagggagg aagtggctgc cggccgctgc ccagcaccct tgtttgcctt gggccctctg 2172 0 

tgggctcctt tttattgctc ttcaatgaag ccagggaaat ggacttcctt gcctcacttc 2178 0 

agttcaacat gtctggaagt ttggtattaa aattaagaaa gtgtggaaat agagcaagaa 2184 0 

gagaaaaatc tctccaagag ataatagtga cctctgagct gggcgcggtg gctcacgcct 21900 

gtaaatccca gtactttggg aggctgaggc gggcagatca cctgaggtcg ggagtttgtg 21960 

accggcctga ccaagatgga gaaaccccgt ctctactaaa aataaataaa taaataaata 22020 

aataaataca aaattagcca ggcatggtgg cgcctgccta taatcccagc taaggcagga 22080 

gaatcgcttg aacctgggag gcaaaggttg cagtgagcca agatcacgcc attgcactct 22140 

agtctgggca acaagagtga aactccgtct caaaaaaaat aaataaataa aaaataaaaa 222 0 0 

tagtgacctc tggccaggtg tggcagctca tacccgtaat cccagcactt tggaaggaag 222 60 

gccgagatgg gcagattgct ttagcacagg agtttgagac cagcctggcc aacatggtgg 2232 0 

aaccccatct ctacaaaaat agaataaaat ttaagaggta atagtgacct tttggtagat 223 8 0 

cgaaacctgg attgctttct ttttctaaat gctgattctt ttctttgtgg tgtttgtgtt 22440 

ctgtgccgat gtccctcccc cagccctgtt attgtgagtg gaagaagggg aaagggttcg 22500 

cccgctactg tgagcccctc ctctcacgct gggtgtcctt ggagaagcct gcacttcttc 22560 

attgtacgcc agggctgggt ccctccctgg agtggttctg tgctgctggg atggggccaa 22620 

cccctcagat gttttctgag tgtcacacac aggtgtgtgc attcatggcc tttgcgtgtc 22680 

ttcctgttgt ggaggcaaaa atgtgaagaa ccctagatga ttttgggacc agggctccat 22740 

cacctgctgt tcattgcaca ccggagcatc caggcatggg tggagagctc agacttccag 22800 

gcacggtcgc aggggctggt ctaaccatgt tcccgcccgc ctgctcgtca gaaccgcctg 22 860 

ttgggagctg ttatcatgat accatacctg ggccctgggc tatccgattc tgacttaatt 22 92 0 

gctccaggtt ggggccaggc cgttgtttgc tgttttgttg tttcttctgt gacgttagcc 22980 

actgggctaa tctgagcccc tcagttacag gtggagaaac tgagacccat gggggtgcaa 23040 

ggacttgccg aggacccaga gccccttggg ggcagagctg aggcggggcc tggctttggg 2310 0 

tcccagagct tccagtcccc ttcccgctct cctaacagct tttttttttg agacaagatc 23160 
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tcaccctgtc 
agctgcgttc 
gccgccatgc 
ggccaggctg 
gctaggatta 
ccaacaacag 
tgccacctgc 
gcaggaggcc 
agccggagaa 
gctcatggat 
gtgtttttac 
gcattgactc 
catctgagac 
tgacccagcg 
tgctcataac 
gctcataact 
ctctgtgtgg 
gctaattgat 
cctgtcagcc 
gcttctcatc 
cactgctcta 
gctcacgcct 
aagttcgaga 
cgccaggtgt 
tcgcttgaac 
ctgggcaaca 
acaaacaaag 
hacctggcaa 
cagctcaggc 
gacctatatt 
cctggaggac 
gactgactgg 
catcattgac 
cggtgagtcc 
tgacgggtcc 
ggacaggaaa 
tagggagtgg 
ttgagcacct 
ctgcaaatgt 
cacaggtgcc 
ccctgtggcc 
gaggaggtcg 
tggagagagc 
gtgtcctgcc 
gatgaaggat 
tgagatgccc 
taaaatacga 
gaagcctgga 
cttcatcgca 
cctcctttaa 
cccaccactt 
ggctaacaag 
cgggcgcctg 
aagcggagct 



acccaggctg 
cagcgattct 
ccagctcgtt 
atctcgaact 
caggctggga 
atgaccgaag 
aaaacagagg 
tgtggaagct 
cttggatgtt 
catcagaacc 
cttgacagta 
ttcaagaaaa 
ctgcaggctc 
ggaaatgagt 
tgccccaggt 
gccccaggtc 

tgggtgtgat 

cccaggactc 
tgcagtggtt 
ccagcacagg 
tcagctgggg 
ggaatcccag 
ccagcctggt 
ggtggcgggt 
ctgggaggtg 
gagtgagacg 
cgtcatttat 
ggtgcaggtg 
cccaggcagg 
tctgccgtcc 
aagctcccgc 
cagcgccgca 
cagctgcccg 

ggggggtccc 

atgcctgggc 
ggtgacagta 
ttatggacta 
gctgtgtgcc 
agttccttgc 
aagctcagga 
cacgcatgtg 
ttgtctggca 
agcatccagg 
ggggtcccac 
gagaggacag 
gggttgactc 
gggagctggg 
gctgaggcag 
ccctgcagtc 
aaacgcgttc 
tgggaggccg 
gtgaaacccc 
tagtcccagc 
tgcagtgagc 



gagtgcaatg 
cctgcctcag 
tttttttgta 
cctgacctca 
tcacactgtg 
tctttgtttc 
acttgtcgct 
tgcctttaat 
gcacctaact 
acgtcctatc 
ttttctcctc 
atatttagct 
agacttagaa 
tcacagagaa 
ccttacattt 
cttatatttg 
ctggagccct 
cttcctgcgg 
cgggagagga 
tagggggtgc 
acgtggcatc 
cactttggga 
caacatggtg 
acctgtaatc 
gagcttgcag 
ctgtctcaaa 
ccagcacccc 
aagttgctgc 
gtccgggttg 
tgcaggtgat 
acattttcgg 
gcatcgagcg 
acctgatggg 
aagccatggc 
ataagtgttg 
tctggccaag 
ggaatgtcgg 
cggccctggc 
cctactcgca 
ctccctcctg 
caccttccac 
caatgatttg 
tgcggcaggg 
actgcacctg 
aggacgtgat 
tgctgcccgt 
aattgaggga 
tgtgggcgac 
ctcatgtagg 
atgctggggc 
aggcgggtgg 
gtctctacta 
tactcgggag 
cgagattgcg 



gcatgatctc 
cctcccgagc 
cttttagtag 
aatgatccgc 
cctggcccta 
ttaacatgca 
tttctggtaa 
ggccttgtgt 
caaccttcct 
tcacgcggct 
ggtggctttt 
gctacatctc 
ccagaagtgc 
gcgggagaac 
gctccaggtc 
ctccaggtcc 
ccgcccattg 
agcacgccct 
cacctgcttg 
tatgggaaag 
ctagtgaaaa 
ggctgaggag 
aaacccatct 
cgagctactc 
tgagccgaga 
atctcaaaca 
tggggaacca 
tcttgggcat 
gctctcgttg 
caatgttgat 
gttcacgctg 
ggtgcacaag 
gctcaaagct 
tcagccatgc 
agctcaggtg 
gacagatggg 
taacaatggt 
cgggagcctt 
ctggggagca 
ggtctgcctg 
ctgaaagcca 
tctcttcctg 
acaggcctgg 
cttgtcagag 
gcttgctgct 
cgggtggatg 
gcaggttggg 
ccctggagca 
ggatgccatc 
cggggcagtg 
atcatgaggt 
aaaatacaaa 
gctgaggcag 
ccactgcagt 



ggctcactgc 
agctgggatt 
agatagggtt 
ctgcctcggc 
gcagctttgt 
ttccatctgc 
gctggaaatg 
ctctttcatc 
gttaacatac 
gtatgcttcc 
gcggtggttg 
agaggagaca 
cctcagagtt 
tttgccccag 
ctgccccagg 
tgccccaggt 
ctgcacctgg 
ggttctccag 
cctggtctgt 
ggatcctcag 
catcatggcc 

ggtggatcac 

ctactaaaaa 
gggaggctga 
tcttgccact 
aacaaacaaa 
tgctacctgg 
tgaacccgtc 
gtgtggccct 
gggacgaaga 
ctgggggact 
gtcaaggcca 
gtgaatgtgg 
agacttgcat 
ccccgacctg 
aagggaccaa 
tagaaagtga 
cgtgcccaca 
ggacgcagag 
ggctgggctg 
ggatcttcag 
aaaaggtgac 
ggctcgcggg 
gcactcagtc 
gcattgcctg 
tgatgtcaga 
gcagaaagca 
gtgagtgctt 
catgaattta 
cagtggctca 
caggagatcg 
aaattagccg 
gagaatggcg 
ccgcagtccg 



aatcttcgct 
acaggtgtgt 
tcaccatgtt 
ctcccaaagt 
cctgtgccat 
cttacagttt 
taatctggta 
ctgtcctgag 
agttctgcag 
gttggttcag 
cttttaatca 
gggtggaaag 
catccggccc 
gccctgccgt 
ccctgcagtt 
cctgcagttg 
ggcaggcatt 
gcagccgctg 
tccaaatctt 
ttggccctgt 
gggcgcggtg 
ttgaggtcag 
tacaaaaatt 
ggcaggagaa 
gcactccagc 
caaaaaacaa 
tgttttatgg 
ttgtttgggg 
ggcccatcca 

ggcggaccct 

tcatctactg 
gccgggacgt 
ccaaggtcgt 
gaggaggaag 
gggaagggca 
gggagctgat 

ctaacatttg 
gtgaccccgt 
ccgtgcaact 
tgcttgttgc 
gacgctcccc 
agagttacac 
cagggactct 
aatctttgct 
cagtcctggg 
tccccggctt 
cagccccgtg 
ccttcatggc 
gttttcccag 
catctgaaat 
agaccatcct 
ggtgcggtgg 
tgaacccggg 
gcctgggcga 



23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
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cagagcgaga ctccgtctca aaaaaaaaaa aaaaagtaca aaaaaaaaaa aattagtctg 2 6460 

ggtgtggtat cacgcgccta taatctcact actcgagagg ctgaggcgga gaattgcttg 2652 0 

aacccaggag gtagaggttg tagtgagccc gtatcgtacc actgccctcc acctgggcaa 2658 0 

tagagcgaga ctctgtctca aaaagaaaaa aaaaaaaaga acatttatgc caggtgtggt 26640 

ggctcatgcc tgaaatccca gaactttgga agactgaggc aggaggatca cttgagccca 26700 

gaaatttgag agtgtcttcc ctgggcaaca tagagagacc tcatctctac cagaaaaaaa 26760 

aaaattagcc cggcatggtg gcatatccct gtggtcccag ctacttaggg ggctgacgtg 26820 

gcaggatcac ctgagtctgg aggcagaggt tgaagtgagc tgagatcatg ccactgcact 26880 

ccagcctggg tgacagacag agaccctgtc tcaaaaaaaa aaaaaaaaaa aagcatttac 26940 

tatccaccat ggaaggtgag actgacctgt gagtgattgt tcaaagaaca aaaaataaac 2700 0 

cccagagata agacaaaagg gtgcctccat gggggtgtga tttaaagctg agaaattggg 27060 

cttcttcccc ctcccctctc accccgtggt ttgctaaagg agatgggaaa aaggattctt 27120 

tttttggctg aaatatttaa cactaaatta aagccaattt taacagcact ttggttgatg 27180 

agtgaaatta acagactggc caaaaataaa cgaacggtct gtactatgtg aaaaagaggc 2724 0 

agctttggcc atgctgggcc aatgtgagtt ttcagggttg ctgggaatgt ctgtgaatcg 2 7300 

gaggaagggc ctagctggga ctctcaggag ccaaggccct gaggggcaac ttgcctggtc 273 60 

cctgccctga ggcgttcact gctttcttcc tgggccagat cacaggcccg gaggctggac 2742 0 

cactgggctg gcactcttgc cgagctgctc cctgacttcc tgaccatgct cctttcagca 2748 0 

gccttgctgc actttagttt ccttgaatga aaaatgggga tgagaatagc tcctacctcc 2754 0 

aaggtgaatg gagtgagttc ggacaggtga ctccctggga ccagtgcctg gcgcctgaca 2 7600 

aggtccagtc agagcccgca ctgctgttac tgataccctt ggctgtacca ggggagaact 27660 

tggttgccat tgccaggtgt tctcccacca cccccactac tgtccctgtt tgatgtgtgg 27720 

cgggaataaa gctgtgcaca ttggagcttt tggcacatcc tggctttcag gtgaaaggtg 2778 0 

cgtgtgtgtt tgagggttta gcctggccaa cccagccatg aggtcggacc tgacctgggg 27840 

gtgagtcctg agctcggcac ccctgagctg tgtggctcac ggcagcattc attgtgtggc 27900 

ttgggccgca cccctttccc tgctgggctg ttgatgttta gactggagcc tctgtgttcg 2 7 960 

cttccaggaa ccaacccgtg tgcggacagg aacggggggt gcagccacct gtgcttctgc 2 8020 

acaccccacg caacccggtg tggctgcccc atcggcctgg agctgctgag tgacatgaag 28080 

acctgcatcg tgcctgaggc cttcttggtc ttcaccagca gagccgccat ccacaggatc 2 814 0 

tccctcgaga ccaataacaa cgacgtggcc atcccgctca cgggcgtcaa ggaggcctca 2 8200 

gccctggact ttgatgtgtc caacaaccac atctactgga cagacgtcag cctgaaggta 2 8260 

gcgtgggcca gaacgtgcac acaggcagcc tttatgggaa aaccttgcct ctgttcctgc 2 832 0 

ctcaaaggct tcagacactt ttcttaaagc actatcgtat ttattgtaac gcagttcaag 283 80 

ctaatcaaat atgagcaagc ctatttaaaa aaaaaaaaga tgattataat gagcaagtcc 2844 0 

ggtagacaca cataagggct tttgtgaaat gcttgtgtga atgtgaaata tttgttgtcc 2 850 0 

gttgagcttg acttcagaca ccccacccac tcccttgtcg gtgcccgttt gctcagcaga 28560 

ctctttcttc atttatagtg caaatgtaaa catccaggac aaatacagga agactttttt 2862 0 

tttttttttt tgagacagag tcttactctg ttgcccaggc tggagtaccg tagcgtgagc 2 8680 

tcagctcact gcaacctccg cctcccaggt tcaagcgatt cttctgcctc agcctcctga 2874 0 

gtagctggga ctacagacat gcaccaccac acccagctaa ttttttttat atttttagta 28800 

gagacagggt ttcatcatgt tggccaggct ggtcttgaac tcctgacctc aggggaacag 2 8860 

acggggttgg cctcccaaag ggcggaaata acaggggtga gccaccgttc ccggcctagg 2 8920 

aaaacttttt gccttctaaa gaagagttta gcaaactagt ctgtgggctg gccttctgat 2 8980 

tctgtaaaga aagtttgatt ggtggctggg tgcggtggct cacacctgta atcccagcac 2 9040 

tttgggaggc cgaggtgggc agatcacctg aggtcgggag ttcgagacca gcctcaccaa 29100 

cgtggagaaa ccccgtctct actaaaaata caaaaaaaaa attaaccggg catggcggcg 29160 

cctgcctgta atcgcagcta ctcaggaggc tgaagcagga gaattgcttg aacctgggag 2 9220 

gcggaggttg tggtgagctg agatggcacc attgcactcc agcctgggca acaaaagtga 29280 

aactccgtct cagaaaaaaa aaagtttgat tggtgtaacc aaagcgcatt tgtttatgga 29340 

ttgtctgtgg cagcttttgt tctgccgaga tgagttgtga cagatctgta tgggctctaa 29400 

agcctaaaac atgtgccatc cgccccttta cagaaaaagt gtgctgacct ctgttctaaa 29460 

gtattggaca actacaatgt ttgctcattt attattctat gatttgtttt ctgctttttg 2 9520 

ttgttgttgt tgttgttgag atagggtttc cctctgtcac tcaggctgga gtgcagtggt 29580 

gtaatttcag ctcactgcag cctcgacctc ctgggctcta gtgatcctct catctcagcc 2964 0 
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tccctagtag ctgggactac aggcacacac caccactcct ggctgatttt tttttttttt 29700 

tttttttttt gtggagacag ggtttccgca tgttgcccag gctggtttca aactcctagg 2 9760 

ctcaaacacc cacctcagcc tcccaaagtg ctgggattac aggcgtgagc caccatgccc 29820 

agcctattct actgtttgta ttacatagct ttaaaagatt ttttatgact ttaagtcaca 2 9880 

agggttcttt gtagaaaaaa atatatatat aggaaagtat aaaaagaaag taaaaattgt 2 994 0 

ccataacctc tccagccaga gacgaccgtt gctgacacct cagcatattg cctttaagtc 3 0000 

ttttttctct aagatagcat ttctcttcat cacagtcata tgctacgcag aattctgtat 30060 

cctgattttt tcacttgaca ttacaacagg tatttgatgg cgctgtgaca aactctttgg 3 012 0 

cacaatcttt taaatgtatg aaatactcca ctgcacagat gtttgctttt aggcttaact 3 0180 

gttcttttat tttgcgtgtg ctggttacag ccgggcacag tggctcatgc ctgtaatcac 3 024 0 

aacactttga gagggtgagg caggaggatc acttgagccc agaagtttga gaccggcctg 3 0300 

ggcaacatag tgagacccca tctctacaaa aaactttttt aataagtcgg gcgtagtggt 30360 

gcatagctgt agtcccagcc accaaggagg ctgagttggg aggattgctt gagccccagg 3 042 0 

aggttgatgc tgcagtgacc tgagattact ccactgtact ccaacctgag cgacagagca 3 0480 

agacttgtct ggggaaaaaa aaaaaaaaaa tatatatata tatatatata tatatacata 3 0 540 

tatacataca cgcacacaca cataatataa aaatatatat ttataaatat ataatatata 3 060 0 

atataaaaat atatatttat aaataaaatt tataaattat atttataagt aaatatataa 3 0660 

tatataatat aaaaatatat attatataat atataataaa atatataata taaaaatata 30720 

tatttataaa taatatataa tacatactta taagtatata tttaaaatat atgtaatgta 3 0780 

tattttttaa tgtatgatat ataatataca tttataaata cacatttata ttattttata 3 0840 

taaaatatat ataaaatctc caagttgctt tttccaaaaa ggtgtcttgc tgcatttcaa 3 0900 

acattcattt aaaaacttga atgctggtga tctggtccag aatgtgttca gtagctgctg 3 0960 

ccagtggcca agcatctcgg gagatgtcta caaaacacgc tggttctggc ctggcgtggt 3102 0 

ggctcacgcc tgtaatctca gcactttggg aggctgaggc aggtggatca actgaggtct 31080 

ggattfccgag accagccttg ccagcttggt gaaaccccat ctctactaat aatacaaaaa 3114 0 

aattagccag gcgtggtggc atgtgcctgt aatcccacct acttgggagg ctaaggctgg 312 0 0 

agaatcgctt gaacccaggg ggcagaggtt gcagtgagcc gagatcgcac cattgcactc 312 60 

caggctgggc aagaagagcg aaactccgtc tcaaaaaaaa aaaaaaagat gctggttcct 3132 0 

aaaatgtggc ccttttcctc ctcacctgct gccagaccat cagccgcgcc ttcatgaacg 313 8 0 

ggagctcggt ggagcacgtg gtggagtttg gccttgacta ccccgagggc atggccgttg 31440 

actggatggg caagaacctc tactgggccg acactgggac caacagaatc gaagtggcgc 3150 0 

ggctggacgg gcagttccgg caagtcctcg tgtggaggga cttggacaac ccgaggtcgc 31560 

tggccctgga tcccaccaag gggtaagtgt ttgcctgtcc cgtgcgtcct tgtgttcacc 3162 0 

tcgtatgaga cagtgcgggg gtgccaactg ggcaaggtgg caggctgtcc gtgtggccct 3168 0 

cagtgattag agctgtactg atgtcattag ccttgatggt ggccaggact ggtagggccc 3174 0 

tcagaggtca tggagttcct tcgtggagcg ggtgctgagg ctgtatcagg cacagtgctg 31800 

gctgctttca cctgggccgt ctcaccgaag tgtccatgga gcctgcgtag ggtgggtatc 318 60 

tgtgtcgatt ttacagatgc agaaacaggc tcagagaaac cgagtgactt ccctaaggtc 3192 0 

acatacccag ttagagcaga gctgggccag gaagtgctgt ctcaggctcc tgaccaggtc 3198 0 

tccttgcttt gcactcttgc caaaaccatg atccagaact gactttgagg tccccggacc 32 04 0 

tcaggctcct ccgaaatggc ctcttggagg ctgctgagcc acagctfcagg acccacctcg 3210 0 

agaggcaaat gtgctttgag ctgccaggcg tcctgggggc cctgccttgg gcacggggtt 32160 

cagacaggcc ccagatgtgt ggggcgtctt tctggacttg agttttcttt tctgtgtggt 3222 0 

ggacacagtg ctcacccctt aaagcacctg tgatgtgtgc agcagcccaa tccctgcctg 32280 

tcgcctgttc tgctagggaa ggaaggaata cttcaggatg gcaggacaac agaaagaggt 3 2340 

ccaggtttta gagcaagggc aggtcaaact tagaaaattc tggaatgagg atgtgcattt 3240 0 

cctcttctgg atctgctaaa agaagaggga aggaggggct gctgggggag gagcccagag 324 60 

ccgagtttac atccggatcc cgcaaggcct cccctgccct gaggtcttgt tttgtgatgt 3252 0 

gcttgtgtcc atcctggttt ctgccgtgtc cccaacatcc ggccaagctt aggtggatgt 3258 0 

tccagcacac actcaccctg tctgtgcacc tgtttttgtg tccgtaagtg ggtatttact 32640 

caccttacga gtgagccact gtgggaattc agggaggtgg cgcagtgacc acccctggag 3270 0 

ggatatgtgt gtggcagggg tcgagggtct cgcccttccc tgcttcctgc gcgtggcttt 32760 

ctccaggacg gggagggctg agctgaagag gtggggacag ttgcgtcccc ccgccaccca 3282 0 

ctgtcctgcg gtgagagcag actcactgag cctgcccttc tcccttgtgc cttccagcta 3 2880 
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catctactgg accgagtggg gcggcaagcc gaggatcgtg cgggccttca tggacgggac 32940 

caacfcgcatg acgctggtgg acaaggtggg ccgggccaac gacctcacca ttgactacgc 33000 

tgaccagcgc ctctactgga ccgacctgga caccaacatg atcgagtcgt ccaacatgct 33 060 

gggtgagggc cgggctgggg ccttctggtc atggagggcg gggcagccgg gcgttggcca 3312 0 

cctcccagcc tcgccgcacg taccctgtgg cctgcaagtt ccccaacctg gcaggagctg 33180 

tggccacacc cacgactgcc cagcagcctc accctctgct gtgggagttg tccccgtcca 3324 0 

cccctgggtg cctttgctgc agttatgtcg ggagaggctc tggtgacagc tgtttcctgt 33 300 

gcacctgctg ggcactaggt cccagctaat ccctgtgcca ggactctaat ttcaccctaa 33360 

cacacatggt ggttttcatt gctggggaag ctgaggcctg agcacatgac ttgccttagg 33420 

tcacatagct ggtgagttca ggatccccca gagataccag ggccagcact cgatccccac 3 3480 

ccagccctga accccaccat gtgctgggat tgtgctggga gtgtccacac gcctgggacc 33 540 

ccagggctgg tgctctcatc tcctttttcc agatcatgag aatgaggctc agggaagttt 33600 

gaaaaaaacc tatcccaagt cacacagcaa caggagcagg atttgaaccc agaaaagggg 33660 

accgcacact ctgttctgct agagtagtta gctgtcctgg gtgatatggc aggtgacagg 33720 

ggcaactgtg cttaacaaag gaacccccat cccccctgcc aagttgggag actagaaggt 33 78 0 

caggggcaga agctctgaag ggccaggtgc agtggctgac acctctaatc ccagcacttt 33 840 

gtgaggccaa ggcgggcaga tgatttgagc ccaggagttc aagatcagcc tgggtaatgt 33 900 

agtgagacgc catctctaca aaaaaatttt ttaaaaatta gctgggcatg gtggttcatg 33960 

cctgtagtcc aagctacttg ggaggctcag gtgggaggat tgcttgagcc caggaggttg 34 02 0 

aggttgtggt gagctgtgat catgccactg cactccagcc tgggcaatag agtgagaccg 34 080 

tctccaaaaa aaaaaaaaga agaagaaaaa gaagctctga ggctccaagt ccccaggcac 34140 

cccttggctt gagggcagac aagggaggag agggtcacct gggcagccct gacttttgtc 34200 

ccctggcaaa gggaccttca gtgaccttgg ccctaggaga gcctctgagc acgtcagcca 34260 

tgtcgaaccg ctcaggaagg gcagcaagaa tttggcttct gacctctgcc tctcctactc 3432 0 

gccatctgca ctgggtgtgg ttgtgcccat tttacagatg aggaggctgg ggcatcgacc 343 80 

agctgaatgc cttgtcccag gtactgcgta ggcagagctg gcagttgaac cccgtgtcct 3444 0 

ggttgtcgct gggggtgggc tgcaccctga cttgtgaggc cagtagcaag gtttgcacgt 34500 

gacttcgtga ccgtcaccca gctctgcagc acatcccgtg acccagctca tccaggccgc 34560 

atgcaaacct gttgccaggc gagaaaccag tcaccgcaca gctgtggttg cctgaaatga 34620 

ttaagctcat taatcacccc ggagtgagga cagactcaga tgaaaaccag caaaagccct 34680 

ggaaactcat gtgaccctgc caatgagggc ggccatgtgc attgcagcct ggccgtcact 3474 0 

cctcggtacg tgttttggac ttaaacgctc cggatgttta ctgagtgctt gattaataac 34800 

atggaaggcc tggtctcatt gctgtgggag tgaaggatgc acagccaggc ctgacatgat 34860 

gagaacaaga acctggagtc tcgctgcctg ggtggtaatc ctggccctgc cacttagcaa 3492 0 

ctgtgtgact gtagccaggt cacttaattt tgctagatcc tgcctgcgct tcagtggatc 34 98 0 

ttgctggttt tccaaggtgg ccaaacactt taaggcattc atgtggtcgc taggctgcag 3504 0 

ggttgaaccc tggctcaccc cgcagggcgc cgtgtgctct gtggcctggc tgtgcctttg 35100 

ctgacaccgt gcccgtgtgt gttcatgcag gtcaggagcg ggtcgtgatt gccgacgatc 3 5160 

tcccgcaccc gttcggtctg acgcagtaca gcgattatat ctactggaca gactggaatc 3522 0 

tgcacagcat tgagcgggcc gacaagacta gcggccggaa ccgcaccctc atccagggcc 35280 

acctggactt cgtgatggac atcctggtgt tccactcctc ccgccaggat ggcctcaatg 35340 

actgtatgca caacaacggg cagtgtgggc agctgtgcct tgccatcccc ggcggccacc 35400 

gctgcggctg cgcctcacac tacaccctgg accccagcag ccgcaactgc agccgtaagt 35460 

gcctcatggt cccccgcacc tcactccctc gttagatcag gctggttctg ggagctgacg 3552 0 

ctgaaaggag cttctcatct ggggttcctg ggtgtacata gatggttggg taggttgtgc 3558 0 

actgcacaag ctgcatgatg ctacctgggg gtccaggtcc aggctggatg gacttgttgc 35640 

ttcatcagga catagataaa tggccaaaac tcctcagctg gaaggtcctg ggcaggatct 35700 

ttgggtgtga aaaccagtca caggggaagg gtgcttgctc atactgccag cacagtgctg 35760 

agtgctttcc atagcgctcg tttactcctc aagcctggag ggtggggagt agcatggtcc 35 82 0 

catttcacgt acaaggaacc cgatgcacag agaggtgtgg caacccatcc aaggccatac 3588 0 

aactggggtg ggttgagccg gggttgactg tggcaggctg gctcaagagt ccctgctcct 3 5940 

gaacccttgc caggcagcct ggcatcagct cggggaattt ttgccctgac ccttggaagc 36000 

aagtgggcct ctttgttctc atgtcagtga tgagaagagt gactttccta tggcccctct 36060 

ggagtacagg tgtttcctgt tggcgggctc ttcccccatg acatcagcag cgagctggtt 3612 0 
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atgattccct acgcagaact tgatagttta taaagctctt tgtcatccag gccccgttgg 36180 

agtctcacgc agacctggtc gcaggcgggg ctggtcttgc ctgtcccagc tgcatggatg 36240 

gggaacttga ggcttgcaaa ggttaagggg ctgttcgagg cccacgctgg caggagatgg 36300 

gcctgggcca gagtctggga cttcccatgc ctgggctgtc tttggtcctg ttgctcacca 363 60 

tccctccctg gggccatgac cttagagagc caaatggagg tgcaggtaac ccacggcaag 3642 0 

gaggggttgc catgactcag agtccccgtc ctgtggccgg cagtacctgg tgcaacgact 36480 

tggatttcag accagccact gtagcccgct gacggtgcgc tcgaagtgcc acagcttctg 3654 0 

aagccaggca ggactcaggc caggagactc tgttagctgt tgagagggag aggccaacgg 3660 0 

atgttctggt tctgctagag agctggttct tcggatcctg gtaccagtgc actgagagga 36660 

ggcccagctt gattctgggg ctgccttgtg gtggcatgtg ctgctcactg acaccctcga 3672 0 

ggagtgtctt ctctcgggct tgttgactgt gcccggtttt ccgcagttca ctggtgcaca 3678 0 

cataggcaca tagcaaaccg cacacacagt cgtgggtatg agtttcacta cattccacca 3 6840 

ccagtgttca ctaccattac ctgccttccg tcttaagtgt tcatcattta aaaataaatt 36900 

tattgggctg gacgcggtgg ctcatgactg ttatcccagc actttgggag gctgaggcgg 36960 

gcagatcacc tgaggtcagg agttcaagac cagcctggcc aatatggtga aactccatct 37020 

ctactaaaaa tacaaaatta gctgggcatg gtggggcatg cctataatcc cagctactca 3708 0 

ggaggctgag gcaggagaat ggcgtgaacc cgagaggcag agcttacagt gagcccagat 3714 0 

agcaccactg cagtccagcg tgggcaacag tgcgagactc catctcaaaa aaaaaataaa 37200 

taaataaaag aaaaataaat ttatgatcta tttcaaaaat aacacatgta ctttgaaaca 37260 

gcagagacac atatgacacg gagaatgaaa ttccccatag cgcaccccca agagacagcc 3732 0 

ctggtccccc cgtctttccc gtggacctcc agcggggcag atgctgagcc gcctgttgtc 3 738 0 

gagtggcatg ctatcccgtc ctccagctcc tctgtggctt acagacaccc acctgcagcc 3 744 0 

ctgtctttgc ctcctctagc gcccaccacc ttcttgctgt tcagccagaa atctgccatc 37500 

agtcggatga tcccggacga ccagcacagc ccggatctca tcctgcccct gcatggactg 3 7560 

aggaacgtca aagccatcga ctatgaccca ctggacaagt tcatctactg ggtggatggg 37620 

cgccagaaca tcaagcgagc caaggacgac gggacccagg caggtgccct gtgggaaggg 37680 

tgcggggtgt gcttcccaag gcgctcctct tgctggtttc caggctgctg cccctgtcct 37740 

tagcagaggg aggaaacaga ggatggctct gggtgaatga tgacttgggc ttcgattatg 3 7800 

tagtcacagg gtatgaccct gagatgcgtg gaaccccgag actgtgatta tatgtagaaa 37860 

ctgggtttcc ccgttgttta agtagtcatg gtggggtcag accccacagg acttttgtct 3792 0 

tttcaagaaa gaaaatggtc gtgtgtcatg caggggtagt tggtactggt taatccaggt 3798 0 

ttatccttta ttttgtggga actgtacagt catttctgct acaatgctgt atatgctctt 38040 

ctgaaagaca cctatgcaaa atcgcacagt aaaaatgaca caactcatag ggaaagcggg 3 8100 

gccagggcac agccctcaaa atctccatca atgacatgta agaaaagaga ggaacctggg 38160 

aaatagcaaa gtgccttttg cacattaaat ggttagctat atcccacaat actgtgcatt 3 822 0 

cgtaaacgtt aatgctgcaa taaatacggc acttcacctt gggaagatct ggagttggct 3 82 80 

tatgagtgtg gaagggtgta gcgcatgagt ttttgtgaaa cactggaagg aggattgtgg 38340 

gaaatcaaat ggaaagttct caccccaggc gtggagaaga gtgggtcatg gccccagcag 3 8400 

tgagcccagg gaggtcagag acggaggtgt gtgtgtgggt gtgaccctgc gcagttccct 3 8460 

gccggctgta gttttttgca ttcgcttaat gtttctcgtg gaggaaattg tgcatgagca 3 852 0 

aatgtgaaac cgtgctgtgc tcaaattgtc ctaatacatc attgcattgg aacagattgg 38580 

ctttnttttt tttttttttt tttttttttt tttgaaatgg agtctcactc tgtcaccagc 3 8640 

ctggagtgca gtggcatgat cttggctcac tgcaaccttt gcctcctatg ttcaagtgat 3 8700 

tttcctgcct cagcctcctg agtaactggg attacagggc atgagccacc gcggccggcc 3 8760 

agatttgcat ttttgaaaca actgctaggc tgggcgcggt ggctcacacc tgtaatccca 3 8820 

gcactgtggg aggccgaggc aggtggatca cctgaggtca ggggttcgag accagcctgg 388 80 

ccaacatggt gaaaccccgt ctctactgaa tatacaaaaa tcagctgggt gtggtggcgg 3 8940 

gtgcctgtaa tcccagctac tcaggaggct gaggcaggag aattgcttga acccaggagg 3 9000 

cagaggttgc ggtgagccga gatcacacca ttgcactcca gcctgggcaa caagagcaaa 3 9060 

actccatctc aaaaaataaa aaatagaaaa acaagtgctg tagcggaagt gagcactttg 3 9120 

cggagtcagg cttgtgtggc ctgttccaca aatgatgtgc tcacggtggc ctcaggccca 39180 

cctggagtct gcagcatggg gcacaacagg ttcattagtg tagaattcca ggacaggcct 39240 

ggctcctaag cagccttctt ttacaaaaac tgcagagccc gcctgtatcg tagcactttg 3 9300 

ggaggccgaa gtgggtggat cacgaggtca ggagttcaag accagcctgg ccaacatggt 3 9360 
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gaaaccccat ctctactaaa tatacgaaaa ttagctgggt gtggtggcac gcgcctgtag 39420 

tcccagctac tcgggaggct gaggcagaat tgcttgaacc tgggaggtgg aggttgcagg 3 9480 

gatctgagac catgtcattg cactccagcc tgggcaacag agcgagacgc catctcaaaa 3 954 0 

aaaaaaaacc tacagagcca cacggcctct ttctccaccg agtgttggtg tgggagcttg 3 9600 

tgttattgtg gtgaaatctt ggtactttct tgaggcagag agaggctgag cgcctggaga 3 9660 

gactttcaca tgggtcgcca tgtccgccgt cggtttcgct gttgtgctcc ccatctgaag 3 972 0 

gctggtgccg tccagacagg ctggacgccc ctttccacca gatccttcct cccgcagcag 39780 

tttctagtta cgttgtactg tgaggtctgt gtccttggtt gatggcaaaa gtcagccgaa 39840 

ttgaaattca gagccatgcc tggctccctg gagcttctct cctgggcagc tgtgatcatt 39900 

gcctctgctg tggtgtgggt ggtggaaatg gattcctttc atcttgcttg ctacaggtga 39960 

ctgtcacgtg gagtcctttg gagagaggga cgtgttaatt gatggatgtg gctcccatgc 4 0 020 

tgagaaagct cctgggcgta cattgcctta gagtttcatt ggagctgcgt tcttttatgg 40 080 

tgtctgctag gcagaagtga tgaagacttg gaagaaaacc cagaaggttt tccacttaat 4 014 0 

ttggaaaatg tgcttttccc ctcctgtgtc ttttgctaag gtccagcctc ctgcagcctc 40200 

cccgctctgt ggactctggc tttgattctt tattaggagt ccccctgctc ccccaaaaga 40260 

tggtgtctaa attatcatcc aattggccga ggttttgttt tctattaatt gtttttattt 40320 

tttattgtgg taaatttata taacataaaa tttgccattt taattgtttt gttattgttg 4 03 80 

tttttgagac agggtctcac cccagtgccc aggctggagt gcagtggtgc gatcatggct 4 044 0 

cactgcagcc tcagcctcca gggctccagt gatcctctca cctcagcctc tctagtagcc 40500 

gggactacag gcatacacta ccacatctgg ctgatttttt gtattttttt tttattgtag 40560 

agacccgcta tgttgcccag gctggtctca actcctggac tcaagccatc ctcccacctc 4 0620 

accctcccaa agtgctggga ttacaggcat gagccacaac acccagccat tttaattttt 40680 

tttttttttt ttgagatgga gtctcactct atcgcccagg ctggagtgca gtggcgtggt 40740 

atcaactcac tgcaacctct gcctcccagg ttcaagcgac tctcctgcct cagcctcctc 40800 

ccgagtagct gggattacag gtgcccatca ctatgcctgg ctaatttttg tattttttag 40860 

cagagacggg gtttcaccat gttggccagg ctggtcttga actcctaacc tggtgatccg 40920 

cccgcctcgg cctcccaaaa tgctgagatt acaggtgtga gccaccgtgc ccggcctttt 4 0980 

tttgtttttg agacagggtc ttgccctgtc acccagactg gagtgcaatg gtgggctctt 41040 

ggctcactgc agcctccgcc tcccaggctc aagttgtgca cctccacacc tggctaactg 41100 

tattttatgt agagacagat ttcaccatgt tgcccaggct gggcttgaaa tggactcaag 41160 

cagtccaccc acctcagcct cccaaagtgc tgagattaca ggcgcgagcc accgcaccca 4122 0 

gcccatttta cctattctgc agttgacagt tcagtggcat tcagtcagtt cacgaggtaa 41280 

ccatcactgc cattcatctc cagactactt caccttctcg gcagatgtcc gaaactgtcc 41340 

gcattgaaca cactcctcat ctccctctga cagccaccat tctactttgt atctctctct 41400 

gccttctcta ggtacctcat gtaagtggaa ttataccaat atttgccctt gtgtgactgg 41460 

cttctttcat gtgacatggt gtcctcaagg ttcatctgtg ttatagcctg tgtcagaatt 41520 

tccttcctta aagcctgaat aataacccgt tgtaaaggct gggcgcggtg gctcacaccc 415 80 

tctaatccca gcattttggg agtccgaggt gggcagatca cttgaggtca ggagtttgag 4164 0 

accagcctgg ccaacatagt gaaaccctgg ctctactaaa agtacaaaat tagctgggtg 41700 

tggtggcgcg cacctgtaat cccagttact caggaggctg aggcaggaga atcgcttgta 41760 

cccgggaggc agaggttgca atgaaccaag attgtgcctc tgcagtccag cctgggtaac 41820 

agagtgagac ttcctgtctc aaaaaaaaaa aaaatcatcg gatggatgga cggaccactt 41880 

cttgttattt atccatccac gggtgctagg tttcttccac ctttggttgt cgtgaataag 41940 

gccactatga acatttcctt ccgtggtgaa ggttttgtac tagtgaggaa aaggcgtgtt 42000 

tgtggtgttg cataggattc tggtaagaaa gtttgcacta accataagta tttgtactac 42 060 

attaaaatga aagctcaggg gccgggcgcg gtggctcacg cctgtaatcc cagcactttg 42120 

ggaggccagg gcgggcggat catgaggtca ggagatcaag accatcctgg ccaacatggt 42180 

gaaaccccgt ctctactaaa aataccaaaa aactagccag gtgtggtggc gggcacctgt 42240 

agtcccagct acttgggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt 423 00 

gcggtgagcc gagatcgctt cactgcactc gagcctgggc aacagagcaa gactccgtct 42360 

cacgcaaaac tctgtctcac gcaagactcc gtctcaaaaa aaaaaagagt tcagggttta 42420 

tgaaactggc cagccgcgta aagtttgctg tgttgttttt gtgcccggga ggagtgtggc 424 80 

cagggtgtca cgtcacacag tacacgtttc tcagatggtg gttctccaga ctgctgtccc 42540 

aaagtctgtt tttgcatctg gttcccacag acccaccctc cacggtgagc ctgattttgg 42600 
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ccagggtagc 

cagctagtgg 

tccacagctc 

cagatgaaag 

gtgtgtgtgt 

ccctttccat 

gtgtgtggtg 

ggcctgggga 

gagacagggt 

cagcctcggc 

tgtaggcgtg 

ttgatacaga 

gattctccca 

cctgttttat 

agcaccatcc 

ggcctgggtg 

ggaacttgga 

aggcttctgg 

tgtgtcggag 

aggcagcgct 

tgagagtcac 

cacactcacc 

gtaccgatcc 

cccccactca 

cacctgccct 

ccaaactgtc 

gcttgagtcc 

atcccttctc 

ggtgcccggc 

ctgttgtcct 

ggagagaccc 

ctctgctgtc 

ggtctccacc 

gtctccgcca 

tctgagccaa 

ccggacactg 

ggaagccatg 

cgcggagcga 

gccgtgtctt 

tccaatgggt 

ggaaagcaga 

gcagcgctgg 

accctgtgca 

ccacagggag 

gtcagagctt 

cagggaaaga 

ggaacgtttg 

gacgggagct 

tgtcacttac 

ctcattggta 

gggcggaagc 

cagccagggc 

ctgtcagaaa 

tcctgagggc 



tggaatcttg 
cttttaggaa 
tgtagtgatg 
tgatgatgtc 
gtgtgtgtgg 
ctgggggtgt 
tgtgtgtgtg 
agacgcttct 
ctcactctgt 
ctcccaggct 
cgtcaccata 
tggagtcttg 
cctcagtttc 
taactgctga 
taagatgatt 
gcaggagccc 
tgcagctgtc 
gctgtcctct 
gccccaaaag 
cgagcatctc 
tgagagacgt 
tgcccctacc 
ctttacccgc 
cccgcccctg 
cccctcacct 
tgtcctggtg 
tggaaggtgg 
gactgtcaat 
gggtggctga 
aatcgctctg 
ccacaccaat 
ctgccagagc 
agtgcccggg 
gtgctcagga 
ggccaaaacc 
ttctggacgt 

ggggtggtgc 

gggtaggagg 
ctgccgaatg 
ggctttccag 
aagccccaag 
ccgacgggag 
gaaatccctt 
ctcctgggct 
ctggttggtg 

gggagaccga 

gggacaggag 
acagccagca 
tggccacgtg 
ggagtgacgg 
cccgggtcta 
tccggggtcc 
ttaccagaag 
tgaagcccga 



cttgtctttc 
ggaatttgtt 
tggatgttgt 
tttgcagctg 
tgtgtgtgtg 
gtgtgtgtgg 
tatgggggag 
ttttatctac 
cacccaggct 
gaagcgatcc 
ctgagctatt 
ctatgttgcc 
ccgacattct 
aagacctaga 
ccctctgact 
aggtgctcca 
tttcagggta 
cctcctacgt 
cccaggctca 
ctggacggtg 
ggctgccctg 
caccctcacc 
accttcccta 
caccctcacc 
ggcctccttc 
ccctgcagag 
aggagacagg 

ggggccttca 

ggaggcctaa 
gcatactcag 
actgggattt 
tctccagcca 
ggtctccgcc 
gtcttggttt 
cagacaggca 
gcgaggccac 
tgcgtgggga 
ccaacgggtg 
ccagcctctc 
ggtcccaaaa 
ggctggaagg 
ttgcagttga 
ccacggcatg 
tcttcctccc 
ggaaggcacc 
aacccatgtg 
cgcgatggcc 
caggtcctgg 
accttgggcg 
ccacagtggt 
taaggcggct 
ttagcagctg 
ccaggtggct 
gatcgaggtg 



agcccggcag 

cagttggctt 

tatctacaaa 

cccagcaagg 

gtgtgtgtgt 

ggtgtgtgtg 

gcaccctttc 

ttagagactt 

ggggtatggt 

tcccacctca 

gttttttttg 

cagactagtc 

gggatcacag 

taaagaaagt 

caatggagag 

ggccccattt 

acccaaagga 

cagctcagta 

tccctgagat 

accccagaga 

ccttcccaag 

tggcccccag 

cccaccctca 

tgtcccccac 

cgttggggaa 

aaaacagtac 

gatgtgttgg 

tgggagcgcc 

agtccgaggc 

gcgggcacgt 

gactttcagg 

gtgcccaggg 

agtgccaggg 

ctttgtctta 

gccccacgac 

caataccatc 

ccgcgacaag 

ggtgggggtg 

acaggctggg 

gcaaacacag 

gaagggggag 

taggttttgt 

ggggctgcct 

agaggccccc 

caggaccttg 

acatgaaact 

ttcctcagat 

gccgggagcc 

ggtggcatag 

gcggcctctg 

gtgcaggagc 

tggggggcct 

gagagtaatg 

tgggcagggc 



ctgtaccagt 

tgacacatgg 

gacacatgat 

ctgtgtgtgt 

gtgtatgggg 

tgtgtgtgcg 

catctgggtc 

tgttttattt 

gatatgagca 

gccttctgaa 

tttggttggt 

tcaaactcct 

gtgtgagcca 

ctgaaaagac 

ggaggggagc 

gccccaggcc 

accagatccc 

agagcccttc 

gcacagggtg 

gtgtggagac 

aggggctctg 

cctcacctac 

cctcccctgt 

cttcacctaa 

ggggttgtaa 

gtgagggccg 

gaagggcccc 

agtctagtga 

ggcaagagct 

agttaggagc 

ctaaacttga 

ctctccagcc 

gtctccgcca 

cagccctttg 

ctcagcatcg 

aacgtccaca 

cccagggcca 

ctgcccgtcc 

gagactttcc 

gctctttcac 

ctctgctgag 

atcatccttg 

gttgactcgc 

gacgctccca 

aggtctccag 

caggctccaa 

agctgggggg 

cagagattga 

cctcttggag 

cagcacacgg 

cagccgagct 

gcacctgttt 

gacacttgtt 

cctgcgccct 



ccagggtcca 
ccccctaggg 
ccttcgtgtc 
gtgtgtgtgt 
gagggaggca 
cgtgtgtgtg 
caagagactg 
gtattttttt 
tagctcactg 
tagctgggac 
ttaatttttt 
gaactcaagt 
ctgctgtctc 
ttactatcag 
ttttccttca 
aaatcactcg 
cgcaggcagt 
gaagggatgc 
ggctgggctt 
ggagagtcct 
agtcattccc 
ccccacatct 
accttcacct 
cccccaccct 

ggggcggccc 

cagtccaaaa 
atggtcttgg 
tgcacagctg 
cttccagagg 
tgattggaga 
gaagtgtggc 
agtgcccggg 
gtgcccaggg 
ttttgacctc 
acatctacag 
ggctgagcgg 
tcgtcgtcaa 
aggcgtgccc 
accctgggga 
agcccctcca 
aggttacaag 
ttaaacttga 
tcctgttcca 
cctgttggtc 
agagaaaagc 
actgagcacg 
ctggcatgaa 
gccctgactc 
actcagtttc 
ggggctcggt 
ggtctcccaa 
cccatggctg 
ctctcacagt 
ctgaaggctc 



42660 

42720 

42780 

42840 

42900 

42960 

43020 

43080 

43140 

43200 

43260 

43320 

43380 

43440 

43500 

43560 

43620 

43680 

43740 

43800 

43860 

43920 

43980 

44040 

44100 

44160 

44220 

44280 

44340 

44400 

44460 

44520 

44580 

44640 

44700 

44760 

44820 

44880 

44940 

45000 

45060 

45120 

45180 

45240 

45300 

45360 

45420 

45480 

45540 

45600 

45660 

45720 

45780 

45840 
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tgagggaacc tttgggcttc tggtggctcc aggcacccct tgacttgtgg tcctgtcact 45900 

ccagtctctc tgtctggctg cacatggcgt ggcctcttct gtaccattga aggacacttc 45960 

agttggattt agggcctacc ctcacccatt gtggtcgtat cttgatcctt catgacattt 46020 

gtaaagaccc tgcttccaaa taagctcaca ttctgaggtt ctggggtgag cgggaatttg 4 6080 

gagagcattg ttcaactagt atagaatgtg acctgtcagc ctcgggcagc cctgagaggc 46140 

aggggctttc cacagcccag ctgggtgccc tgggctccgt gctgtccgag gagacgccat 4 6200 

ccccacaccc gtccttcacc cgccaccctc ccgcaggtac ctgtacttca ccaacatgca 46260 

ggaccgggca gccaagatcg aacgcgcagc cctggacggc accgagcgcg aggtcctctt 46320 

caccaccggc ctcatccgcc ctgtggccct ggtggtggac aacacactgg gcaagctgtt 463 80 

ctgggtggac gcggacctga agcgcattga gagctgtgac ctgtcaggta cgcgccccgg 46440 

ggcctgccct aaccgcagac acccggcctt cattgtcagt aatggcagca gctgccacat 46500 

tgtccgagac ctgccgtgag cccagtgccg cgccaggggc tttgtgtgta gcgtgttttg 4 6560 

tcctcacact gacagctgta ggctggggtt ctgagtgagc cccacagggc agaggcagaa 46620 

aatgagtctc agagagggtg agcgagctgc ttggggcccc acagcaggag atggagcagg 46680 

actgcagcct agcctctgcc cccagcacct gcgcaagaag ctgctctgct ctggactgtg 46740 

ttaggctgcg agggctggag agaaatgaga gttggtgctt agagaggggg cgcaggtccc 4 6 800 

catggctttt cctcttatga tgaggtagat gggtgaaggg aggggccatg cttgcagggg 46860 

ccagtgaccg aggcccgccg ttggaactga tggccttcat cccgagccca gcccaggtgg 46920 

gagcagggct ttccgagggc ttgtcttggg tcggcctgct tccagggact ctgctgcagc 46980 

tcccacccct gtccaaagca tggaatcccc caggctccct ggcagtcctg tcaacctctg 47 040 

tcctcccaag ctgagtgtgg ggcaagttct ggaggtcagc actgctcagg ggggcccacg 47100 

ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 47160 

gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 47220 

gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 47280 

ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggctggcag 4 734 0 

tggggtgggc agggtggcct ctaaacccga cccctggagg aggctggagg ccagtgcaag 47400 

atcctgtgtg gcctcagcca ggcggtggtc tctgccagat gccaactgtt gcccgctggg 47460 

gttcagcgac atgtccgaat gtcccgaggc ctctgaggtt gttttctttt gccgcagaac 47520 

aaatcaccac gaacagcgtt ttaagacaac accaactctt tttttttttt ttttttttga 47580 

gtcaggatct tgctctgttg cccaggctgg ggtgccctgg tgcaaacaca gttcactgca 47 640 

gcctcgacct ctgggcttaa ttaagtgaac accttgcctc agcctcccag gtagctggga 47700 

ctacaggtgg gcaccaccac acctggctaa tttttttttg tagagacggg gtttccccat 47760 

gttgcccagg ctggtctgca actcctgggc acaagctatc tgcctgctgt ggcctcccaa 47 82 0 

agtgctagga ttataggtgt gagccactgg cctgacaaca cccacggatt gtctctcagt 47 880 

tctgtaaggc aaagtccagg cacagcgtgg ctcacctggg ttctctgctc agggtctcac 4 7940 

ggggccagaa tcaaggtgtc aggaacgctg ggccctcagc ggaggctctg tggagaaatt 48 000 

agcttccttg ctcactcagc aggtagcagt tgtgggatcg aggttctgtt ttctctctgg 48060 

ttattggtcg gggaccactc tcagctccta gaggccaccc caggtccttg ccccgtggcc 48120 

ctctctgcct cagcagtggg ggctccctgc gtcagtccct cccgcacctt gagtctctct 48180 

gatttgcttc taaagggccc tgtgattcgg ctcagccacc tttagattag gttagcctcc 48240 

cctfctgatag actccaagtc ggctgattaa taaccttact cacatctgca gaatcccttc 4 83 00 

tgccacataa ggtcatgacg ccgtgctggg gactggggtg ggaaattacg gggtcattta 48360 

ggattctgcc tgccactgcc ttgctgtgtc ccagggcttg ggggaggggc ctccacagct 48420 

gggaccacag tccttcctcc cctccatggt aaccatctga ggattacttg agaccagcct 48480 

gggcaacatg gtgagaaccc atccctacaa aaaatacaaa caaaaaggga ccaggctggg 4 8540 

cttggtggct catgcctata atcccagcac tttgggagac caaggtgggc tgatcacttg 48 600 

aggttgggag ttcgagacca gcctgcccaa catagtgaaa tcccgtctct actaaaaata 4 8660 

caaaaattag ctgggtgtgg tggcaggcgc ctgtattccc agctactggg gaggctgagg 4 8720 

tgggagaatt acttgaacct gggaggcgga agttgcagtg agccaaaatt acgccactgc 48780 

actccagcct aggcaataga gtgagactcc gtctcaaaaa aaaaaaaggg ccaggggtgg 4 8 840 

tagtgacaaa gagaccctat cccaaaaaaa ccgaacactg aatccttgag actgagtaag 48900 

gacactgtga aatttttctg ggtggggcag ggaacagagc gtcttctgtc atttcttcca 48960 

cctgggtgtg gtcagctctc cctccaagct gcctcctctt cttctcattg tccgggtgtt 49020 

ggacacattt ggttaactgg atagaataac gcgagttccc agggacttgg tccatttgct 49080 
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attttatttt 
tgagatggag 
tgcaacctct 
attacaggca 
ttcgccattt 
cctcccaaag 
aattcccatg 
ttaaagcaac 
gttgggaggg 
gtgctgtggg 
catgtgtcca 
tataaggacg 
tgagatcctt 
gccttaagac 
acgggaagag 
agcgggagga 
atttctacaa 
ctgtaatccc 
ccatcctggc 
gcgcagtggc 
ggtcaagaga 
aagagatgga 
aattagccgg 
agaatggcgt 
cactccagcc 
agccaggcac 
tggcatgaac 
ctgggcgata 
gtagtcccag 
ctacagtgag 
ctcaaagtaa 
agtagagact 
ggcatccatt 
aataagtggg 
tcttatttgg 
tgtttgtctt 
tcccacatct 
gtgctcctgc 
tgatggacag 
acagtccctg 
ttctactatg 
acaaattttg 
tgacatcttt 
tttccagctg 
ttttatatga 
ttgtgtcttt 
gtcactcagg 
ttcaagtgat 
tacccagcta 
tctcgaactc 
gcgtgagcca 
cttggtgaat 
tgttccagaa 
cttccttcct 



attttatttt 
tttcgttttt 
gcctcccagg 
cccaccacca 
tgcccaggct 
tgctgggatt 
tgtattagtg 
agaaatggat 
ctgtgcttcc 
tgccagcgtt 
tctcctttgt 
tttgtcattg 
aacctgcaaa 
atgggtgtat 
ggccaggtgt 
tcacttgagc 
aaagtaaaaa 
agcactttgg 
taacacggtg 
tcccgcctgt 
tcgagaccat 
gaccatcctg 
gcatggtagc 
gaacccggga 
tgggtgacag 
agtggcaggt 
ccgggaggtg 
gagcaagact 
ctactcaaga 
ccatgatcgc 
gtaaatagga 
ggtgcagaca 
tgtgtgtcaa 
aggatcctcc 
cccctaccct 
gttttgtctt 
gtattgccaa 
agaacctgct 
gtacctgatt 
tggcatccag 
tgcattcggt 
ttacattttc 
ggctcatgtg 
tctctctgca 
tttctgttat 
ttgttttgtt 
ctggagtgca 
cctcttgcct 
atttttgtat 
ctgacctcgt 
ctgtgcctgg 
gttctgtgta 
acatcaatta 
gactgcctgc 



attttatttt 
gtcgcccagg 
ttcaagtgat 
taccaggcta 
ggtcttcaac 
acaggcatga 
tcccacggct 
tcccccaatg 
tctgggggct 
ccacacttgt 
gtctgcgtct 
ggtttagggc 
gacccttttt 
ctttctgggg 
ggtagttcac 
ccaggagttt 
aaaaaaaaaa 
gaggctgagg 
aaaccccgtc 
aatcccagca 
cttggctaac 
gctaacacgg 
gggcgcctgt 

ggcggagctt 

agtgagactc 
gcctattgtc 
gagtttgcag 
ctgtctcaaa 
ggctgaggca 
accactgccc 
ggagagacaa 
ggctggcctg 
tttccgtgtc 
cagagaggcc 
ccaggctgtg 
tgcagcagcc 
gggtgatggg 
gacctgtgga 
ctctgcctgc 
tttccaaagc 
gctatgaatt 
attttcaggt 
ttatttagaa 
actgatttct 
tttaaatgtg 
ttgcttcgtt 
atggcgcgat 
cagcctcctg 
ttttagtaga 
gatccgccca 
ccattaggtg 
ctcttgaaaa 
gatccagttg 
ttgagctgtc 



atttatttat 
ctggagtgca 
tctcctacct 
atttttttgt 
tcctagcctc 
gccaccacgc 
gctgtaacaa 
tgctggagac 
ccagggagga 
ggatgcgccg 
ttacctcttc 
ccacccaaat 
ccaaaaaaag 
ggcactatcc 
gcctgtaatc 
acaagtagct 
aaaaaaaaag 
caggcagatc 
tctactaaaa 
ctttgggagg 
acggtgaaac 
tgaaaccccg 
agtcccagct 
gcggtgagcc 
cgtctcaaaa 
ccagctactt 
tgagccgaga 
aaaaaaagcc 
ggagggttgt 
tccagcctgg 
gtgggcagtt 
tgatgtcaag 
agtaggaaga 
gggcctgcag 
gttctgaggt 
cacccatgtg 
acaccacggt 
ggtaggtgtg 
taggctgctg 
tgattgtgtc 
ttcctctaag 
agtttgaata 
gtgttgctta 
aatttaattc 
ttgggtgtgg 
tgttttgttt 
ctcagctcac 
agtagctggg 
gacggggttt 
cctcggcctc 
tgttttatca 
gaatgtggat 
gttaatagtg 
agttattgac 



ttatttattt 
gtggcgcgat 
cagccttcca 
atttttagta 
aggtgatcca 
ctggcaccat 
atgaccacaa 
cagaagcctg 
tctatttgtt 
cctcaacctc 
ttcttgtctg 
catccgagat 
gttatgctca 
aaccccttat 
tcagcacttt 
aggcaacatg 
ccaggtgtgg 
acgaggtcag 
atacaaaaaa 
ccgaggtggg 
cccatcaaga 
tctctactaa 
gctcgggagg 
gagatcgctc 
aaaaaaaaaa 
gggaggctaa 
tcatgccact 
aggcatggtg 
tcgacccacg 
gtgacagagt 
cagactgatg 
caacttctgt 
ctctgtaggc 
gagggccagt 

ggg a g aca g a 

cccgtgacaa 
gctcatgccc 
acctaggtgc 
cctggcatcc 
ttcctttgcc 
tactgcgttt 
tttttacact 
gtttctaaag 
tactgtagtc 
tgtttttgtt 
ttgagacagt 
cgcaacctct 
attacaggtg 
caccatgttg 
ccaaagtgct 
cccagcatca 
tctgctgttg 
ctcatcaggt 

a ggggtgtgg 



atttatttat 
ctcggttcac 
agtaactggg 
gagacgggtt 
cgcacctcgg 
ttgctatttt 
actggatggc 
cgaccaaact 
ggcccttcca 
tgcccatctt 
tgttgcctct 
gacctcgtct 
cagattctag 
acaatgaaag 
aggaagctga 
atgagacccc 
tggctcacac 
gagattgaga 
ttatggccgg 
tgaattacaa 
tcacaaggtc 
aaatacaaaa 
ctgaggcagg 
catgccattg 
aaagaaaatt 
ggcaggagaa 
gcgctccagc 
gtgcatgcct 
gagatcaagg 
gtgaccctgt 
gtatgggcac 
aactgtttcc 
tgccaagagg 
tctcatgagt 
gcctgacctc 
tggtggctgc 
agtccacctc 
tcctttgggg 
ttttaaaatc 
ctcctttctt 
cctgcatctc 
tctcctgaga 
agttggggct 
tgagagctta 
gttattgttt 
gtcttgctct 
gcctcccggg 
cacgccacca 
gtcaggctgg 
gggattatag 
tgcagtttat 
ttgggtggag 
tgtctctatc 
agtctccaac 



49140 
49200 
49260 
49320 
49380 
49440 
4950O 
49560 
49620 
49680 
49740 
49800 
49860 
49920 
49980 
50040 
50100 
50160 
50220 
50280 
50340 
50400 
50460 
50520 
50580 
50640 
50700 
50760 
50820 
50880 
50940 
51000 
51060 
51120 
51180 
51240 
51300 
51360 
51420 
51480 
51540 
51600 
51660 
51720 
51780 
51840 
51900 
51960 
52020 
52080 
52140 
52200 
52260 
52320 
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tctaatggtg gatttgttta tttctcctag tagttctatc tttttctctc cttctaccct 52380 

tgatcctctt ctccccctag ggcttcctgg tgttggtggt gggagagtgg ggtagtgaag 52440 

aacctggact ttagggccaa agaggccagg gttcaaatcc tggctctgtc acttcccagt 52500 

tgagtgaccc tggctggtgc ctgaatctct gtgagcctcc acttcctcct ctgtgaaatt 52560 

gagagcactt acctggcagg ctgtcatggg catcaagtaa cagggcactc cacctggacc 52620 

ctgacacgtg atgcacagga atgccagctg ctatgccatg ggtgtggcag tagtaataaa 52680 

gtgaccatct gtatcctcac cacagtgaag cctgtccagg gctttctctc ctatgccccc 52740 

atgcctccag gtggccttgg atcctgttgg ttctgtgctc tgctcagcga cctttctccc 52800 

gtgggagttc ctgggggttc agcttcatcc tacagacagc agcacacact ggctgtgcac 52860 

cctttttttt tttttttttt ttttttttga gatggagtct cgcttttttc gcgcaggctg 5292 0 

aagtgcagtg gtgtgatctt ggctcactgc aacctctacc tcctgggttc aagtgatttt 5298 0 

cctgcctcac cctcccaagt agctgggatt acaggctccc accaccacgc ccggctaatt 53040 

tttgtatttt cagtagagat ggtgtttcac catgttggcc aggatggtct tgaactcctg 5310 0 

acctcaggtg atccgcccac ctcagcctcc caaagtgcag ggattacagg cgtgagccac 53160 

cacacccgga gtgccggttg tttttagcag tttgtcttgt tcctggagag actggctcct 53220 

gcccaggagc tcggggagta gggccgcggg gtgctgcctc acacctcgag tttggccgta 53280 

agcagagggg acattttgtg actgtccccc tcctgagctt cccagcagct tttctccaag 53340 

ttacagccca aaagctcagg tggatttgca acccaacggt gtctgtgcac ctcccactga 534 0 0 

tgcccgaact gccctggcca agaaacgggg ccgtcagaac gctgcactaa ctgcagcctt 53460 

gggcctccat gccagaggcc atgcccttcc atccaccacc ccctggcctg ggccctggcc 53520 

ctcctggctc gggaactcca ggccccttcc tcacggatcg agagacgtgt atttaccgca 53580 

caggtgcttg tcattctctt gtggcctctt ctccagggag atcacagaag gacagggcct 5364 0 

cactgaggtc tcggacatgg accctttgat agtggcagga gccaggctgg gcaagaggcg 53700 

gccacagtca cctcagcagt gccatcacca ccgccattca gcccttccct gagccgggcg 53760 

cgcccctggc tctggcccca gtgtcccagt tacagctcac aggagcttgt ggtgcccagc 53820 

ggctgcttct gattgagagt cgaggtcgga ggctttggga ggctgagagg ctgctcggtt 53880 

tcacaactgc tgagggagac ttgggctcca tctcaggtct gccccatgtc gccctcaacc 53940 

tccagccacc ggtcctccgt gtcccccatg gccaggcacg gcttgcagac atctgtcgtt 54000 

ggctcctctc agccgtcgtg ggctgaccct ggcacgtcct cctgtggctg agcccagtgg 54060 

ggacagctgc ttccttttat taccctagaa ctctcgtctt tgatcaggcc ccctccccta 5412 0 

tgccacacag tccctgtcac tcgggtgagc ccagtagtca tggggaaggc ctgcgggttc 5418 0 

caaacatcca aaggcttgcg tgcagcatga cagcttgaaa ccgatgtttt ttaccttgat 54240 

cagatttcag cttggcgggg gctttgctca gctttcagtg aggcctgggc cgatttccca 54300 

gcatcccctc ctgaggccag cctctgtttc ctgtgatttt ctgcacaaag tgggagggag 54360 

gagtcttagg aaatgggggg ccacctcgaa acctaggcct cctctggctt ctctgtgcca 54420 

gtgcccccac gctttgtgtc tgtgtcccca gcccatggga ctgtgttatt ccctgagtgc 54480 

tgccgcatgc ccagcccgca ctgaggacgt ggagccccga ggggcaggat ggcctccatg 54540 

gtcacacgta ggaagtggcc tccaccctcc gatgatcctc tccccccctc cctttcagcg 54600 

ccttccccgg gggtgtcatc agccctcctg cctgtgcttt gtcccgtctt ctgcaggcgc 54660 

atgggacgtg ctgacaggtc ctctgccggg ttcctgcctt gctatgcgca cgctggtcac 5472 0 

cacagaggcc tggcccttct tctgtagcag tcccacaccc gcaacaggtg tggctgctga 547 80 

ccacctgctt tctgcccctc tggtcctgag gagggcgcag tgggcactca ggcgtggctg 5484 0 

agcagatgtg tgttgccggg aggaggaagg actgctccag tcagggctga atttcccacc 54900 

cggagcattt ctgctgtatt tggtgtagcg cctgctgctt aaagctctga ttcccagttg 54960 

gcaccctttc ccttctgcat tgaaaaacat acggatgcat gtcttcttgc agtgaatgtg 55020 

tattctccca gcctctcttc tgggttgggg ctggaggtgg agcggcacac aggagccgca 55080 

gcgatggagg atgtgcgggt gcagcacccc gtacagcagg gatgccaaac ccgcgctgag 55140 

tccctctcaa cttctgcttt gaagcccagt cacgccattg cctgggtttt gctgggcggg 55200 

gctgcatgtg atgttctcct ctgtccctcc cccagagccg cccacctgct ccccggacca 55260 

gtttgcatgt gccacagggg agatcgactg tatccccggg gcctggcgct gtgacggctt 5532 0 

tcccgagtgc gatgaccaga gcgacgagga gggctgcccc gtgtgctccg ccgcccagtt 55380 

cccctgcgcg cggggtcagt gtgtggacct gcgcctgcgc tgcgacggcg aggcagactg 55440 

tcaggaccgc tcagacgagg tggactgtga cggtgaggcc ctccccgtca aggctctgcc 5550 0 

aagaccctgg ccctgccctc cgggatacga gcttggggct gcctccggcc tcacaggagt 55560 
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aggggctctg 
aggaaaactc 
tgtgagggtc 
attcaggggt 
cagtgctgtc 
acatgtagaa 
gcaggatctt 
ttctttcatt 
tgtcacccag 
gttcaatcga 
atgcctggct 
gtctcgaact 
caggcatgag 
ttgcctgcca 
tcgctctccc 
acgtcctcat 
ctgcatgttt 
aattttatgg 
cctgccacct 
ttgcacaatg 
agacctacgg 
cagccctgct 
ccgtggttag 
ggccgggtag 
gtatggcacc 
gaagtcgaag 
agagagagag 
ttaagtaacc 
caccaggcat 
ggtcatattt 
tattcctatt 
gcatgatctt 
cctcccgagt 
caatagagat 
atccacttac 
gagggtcaca 
gtcccgctct 
gttctggctg 
cccatccatt 
cagggcaggt 
ggtgggcgtc 
ggccagagac 
tgctccctcc 
actctcggga 
acctggtaga 
aggcacctga 
gcagtccctc 
tgtactagac 
agtccagacc 
tgagtgcatg 
gccagtgtgt 
acgagctcat 
ccccaggaac 
tgggagactc 



aaaacctttg 
tgcagttcca 
tatctcacct 
gggctgcaga 
tccactgacc 
tcaggaggta 
cctttagatc 
cattcattca 
gctggagtgc 
ttctcctgcc 
aatttttgta 
cctgacctca 
ccaccgcgcc 
catgttgtca 
agagccaccg 
tcacaaagag 
ttctataaag 
aggctgtaga 
acttttgtaa 
tctgtggctg 
ccttcaaagc 
gcgcatcctg 
gaattgtccc 
attatgaagg 
cgcatctgct 
cgggagcagg 
agagcgcctc 
agctcccatg 
ctgctcccat 
tattctgttc 
ctattatttg 
ggctcactgc 
agctgggatt 
ggggtttcac 
ctcggcctcc 
tttccgttga 
agcttctcct 
cacgctctgt 
acaaggacag 

ggggaactga 

tggggagggg 
tgtggccttg 
cctcccctcc 
ggcgaggctc 
gccttgggcg 
cacatacctc 
ccctgcgtgg 
cactccccgc 
ttggttgctg 
gcctctcctt 
cctcatcaaa 
gtgtggtgag 
gtggagttta 
aggcggctgg 



cttgcaggga 
cccatcctgt 
tcctgcatta 
ggtccatggc 
agtctgtggg 
taaagagatt 
ttgacagtga 
ttcattcatt 
cctggtgtaa 
tcagcctccc 
tttttagtag 
ggtgatccgc 
cggcctgagt 
gcaccatggg 
gctctgttag 
cagacatttc 
gcatagtggt 
ctaggggctg 
ataaagtttt 
ctttcatgcc 
cccagacctc 
gcccagcatg 
tgtgttggtc 
aaagaggtct 
cagcttctag 
tgcatcacat 
tccctcttgc 
tgaactcaca 
gacccaaaca 
tatgctatgc 
agacagaatc 
aacctccacc 
acaggcacac 
catgttggcc 
caaagtgcca 
gatttggagg 
cccgtgtgcc 
tccagaagct 
ggtcccctta 
gcaagttctc 
tggaggcagc 
tgctgctccc 
cctcaagctc 
ctcgtggcca 
gggtctgtta 
ctggggggca 
cataggcctc 
tggtcctaga 
tgccctgcat 
gcagccatct 
cagcagtgcg 
ccagcttctg 
ggggaggaga 
gaggctcctt 



gattgccaag 
cccaccaggt 
ggtcaggagt 
tcacaccctg 
atagtgctta 
tgctcaggca 
aacacatctc 
cattcattca 
tctcggctca 
gagtagctgg 
agacagggtt 
ccgcctcagc 
tttcctttta 
cccaggactg 
ataattcaca 
gtagaagatg 
taagtccttc 
gtaaactaag 
cttggtgcac 
aaaagcagga 
acgtctggcc 
tgctgtgtgt 
cattttgcat 
gtctggctca 
tgaggtctca 
ggtgagagag 
cctcaccttg 
gtgagagccc 
ctgcccacca 
tatgctatgc 
tcgctctgtt 
tcccaggttc 
accaccacac 
aggctggtct 
tgattacaga 
ggcagacgtt 
ccgcggtgct 
ttcttccctg 
tctcccattc 
tgggggcagg 
cgtcagcgcc 
gtgtgggctg 
tgctgagcac 
ttcctgtcct 
ctccttgcat 
ggcaggaggt 
gccacagggt 
aagggtccca 

ggtgggctgg 

gcctgcccaa 
actccttccc 
gcacggggaa 
cgtgcctttc 
gcgggaggca 



tctgtctttt 
agtgtggctt 
ttcacagaaa 
gaaaatccgc 
agcctgagtg 
tcctgggccc 
ttctgtgccc 
ttcgagacag 
ctgcaacctc 
gatgacaggt 
tcaccatgtt 
ctcccaaagt 
tgaaggacct 
ctgaggagct 
tgcagtctgg 
agggcctggg 
cagctcattg 
ggcccagggg 
agccatgccc 
gaactgagtg 
cttgacagac 
gtgatttcag 
tgctatgaag 
tggttctgta 
ggaagctttg 
ggagcaacgg 
agaggagatg 
atttgctact 
ggccctacct 
catgccatgc 
gcccaggctg 
aagcgattct 
ccgggtaatt 
caaactcctg 
tgtgagtcac 
ggagccatct 
ggtggcaggc 
cttggttacc 
ccagggcagg 
cgtggctatg 
ctggcttgct 
cctgcacctc 
cactgccttc 
tggcacccac 
ggcgtagacc 
gcgttgaggt 
catcgagggt 
tctgtctgct 

ggggcaccct 

ccagttccgg 
cgactgtatc 
ggggcgtccg 
cagcggggct 
gggaagcctt 



aggcccaaca 
gaaggcagac 
cctgaggcac 
ccccaaaaga 
gtttctatca 
tctctgacca 
cctgtgagtt 
agtcttgctc 
tgcctccagg 
gcgcaccacc 
ggccaggctg 
gctgggatta 
gcttggttgg 
gttgatgccc 
ccactgtcct 
agtaacctcc 
accattggag 
ccaaatccag 
attcattcat 
gttatgctgg 
agagcttccc 
cttgcaggag 
gagcacctga 
ggcagcacca 
actcatggtg 
agagagagag 
ccaggctcct 
gcggagaggg 
ccaaccttgg 
catgccatgc 
gagtgcagtg 
cctgcctcag 
tttgtatttt 
gcctcaagtg 
tgcgcccagt 
gagccccctc 
ccttacgccg 
agaaaatcat 
acaccggggg 
gctccctctg 
cttcctccct 
cagtgggttg 
cacagccccc 
ccccccacca 
tccccacagt 
ctcagccctg 

gggtggagac 

ctctgtttgg 
ccagcctctc 
tgtgcgagcg 
gacggctccg 
ggctgggttc 
gggggctgtg 
tcccagggca 



55620 
55680 
55740 
55800 
55860 
55920 
55980 
56040 
56100 
56160 
56220 
56280 
56340 
56400 
56460 
56520 
56580 
56640 
5670O 
56760 
56820 
56880 
56940 
57000 
57060 
57120 
57180 
57240 
57300 
57360 
57420 
57480 
57540 
57600 
57660 
57720 
57780 
57840 
57900 
57960 
58020 
58080 
58140 
58200 
58260 
58320 
58380 
58440 
58500 
58560 
58620 
58680 
58740 
58800 
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gcggccagga ggacagactg tgagctgtgg gctcggcggc tacagagtct gcctcagtgg 5 8860 

gcggggctga tggtgtccag gtgcctgcag cacgcaccca cccacgggac cttgctgagc 58920 

agcgtctgtc aggcagcaag attacccgag ggctgcagtg gtcctgttcc ctggcagctt 58980 

actgtctggc tgaggaggag tgatgttcac atatgcacac atgtcatgtg cacacacatg 5904 0 

tacatgacaa catcccacat gctcctcaaa tagcatgacc tgtacagtca cggatatagg 59100 

gcctagggga taggaggcca agacagtcag ggaagacttt ccagaggcag tggctcctga 5 9160 

aaggctgtct gattcaggca ggaagggagc tgagttcaga taggaagtag caatgagtca 59220 

ttgtgtctgg ggacatggcc actccttcgc tgcagaggga cctgggctga gagctcctct 59280 

cttatggctg cagtcgggag agaagtctgt tggggggaga agggggcttc ctcaagggac 5 934 0 

tccctgtgcc ctttggcacc ttcgtgccag gtcaggcttg aggcctgaag gcagtggtgg 59400 

gggccaccaa gggtcgcctc ctctgctggg caagttccca gtctgacggg cctgtgccgt 5 9460 

gggccccagc tgtgggggcg ctgfctgatgc gcagccaggc ctcgccgcca gagcccgcac 59520 

gcttccattc cgctgacttc atcgacgccc tcaggatcgc tgggccggcc ctgtgggaga 59580 

gtgaatgtgg cttttgccaa agttgagtct ggagcctgga aacttcccta tgggcagcct 5964 0 

tgatagtgga gtggcccaag gagcccaccc agccgaccct gcccctcccg tggctggtgg 59700 

gcggcaccag gggctgcctg gctttgctcg ttcaccaaca tcacccgggc tggccagggc 59760 

gcgctcactt ctgccaccac cgagggccct gggcgaagga gtgaatacca ggctgccttg 5 982 0 

gcagggatgt gttgagggct gtggggagtc ggacagcggc gggggtcaga ggaggaggag 5 9880 

ggtgcaccgt gcaggctgaa gggccacgtt accctgaggt tggccaggct ccccaggcct 5 994 0 

agcctcccag ctcccccact ttctccccac cctccaccag tggcaaagcc agccccttca 60000 

gggcgcacgg tgtctgcccc caaggagggc ccattccgtt ggggttaatg ttggccacct 6006 0 

ctttctgttt gtctctggca gaaatcacca agccgccctc agacgacagc ccggcccaca 6012 0 

gcagtgccat cgggcccgtc attggcatca tcctctctct cttcgtcatg ggtggtgtct 60180 

attttgtgtg ccagcgcgtg gtgtgccagc gctatgcggg ggccaacggg cccttcccgc 6024 0 

acgagtatgt cagcgggacc ccgcacgtgc ccctcaattt catagccccg ggcggttccc 603 00 

agcatggccc cttcacaggt aaggagcctg agatatggaa tgatctggag gaggcaggag 6036 0 

a 9tagtctgg gcagctttgg ggagtggagc agggatgtgc taccccaggc cctcttgcac 60420 

a tgtggcaga cattgctaat cgatcacagc attcagcctt tcccactgag cctgtgcttg 6 04 80 

gcatcagaat ccttcaacac agaggcctgc atggctgtag caacccaccc tttggcactg 60540 

taggtgtgga gaaagctcct tggacttgac cttcatattc tagtaggaca tgtgctgtgt 60600 

tgtccacaaa tcctcatgta ccctagaaat gaatgtgggg gcggctgggc tctctccaga 60660 

gctgaaggaa tcactctgta ccatacagca gctttgtctt gagtgcagct gggatttgtg 6072 0 

gctgagcagt tacaattcct acgtggccca ggcaccagga acgcaggctg tgtttgtaga 60780 

tggctgggca gccgcaccgc agagctgcac catgctggtt tgtatcacat gggtgaccat 60 840 

ggtatgtcta agaaggtgga gtccctgtga ggtctgcagg tgcccccaca gctccaggcc 60900 

accttgagga ttgcctctgc ctgcccagcc ctgagttccc tctcccctgt cctgtcccac 60960 

tgtcacccca agccggcctc attgggagcc tgttggatgg cagggtatag atgtaacctg 61020 

attctctctg gggagcgggg ttatctggct tctcaagagc tcctaggagc ccacagtggt 61080 

ggcaccatca cagtcgcagc agcccccaga gaacgcggcc ctgtctgttc ctggcgtgct 61140 

ctgtgctgcc ccgcctgggt tccctgcccc agtcgcaggc cccttggagg aggtaccatg 612 0 0 

tgtctcccgt ttcacagatg agccccgggg agctcactct agtagtggcc agagaggcct 61260 

gcggctcagg gagcggggca catttccaac aggacacacc gccctggtct gagtctcgtg 61320 

ggtagtggga gcagaggaga gcgccctatg tctgtggggc ggcttggctg agcctggaag 613 80 

ccacctgacc tcccccgtcc cttccctgcc aggcatcgca tgcggaaagt ccatgatgag 61440 

ctccgtgagc ctgatggggg gccggggcgg ggtgcccctc tacgaccgga accacgtcac 6150 0 

a ggggcctcg tccagcagct cgtccagcac gaaggccacg ctgtacccgc cggtgagggg 61560 

cggggccggg g a ggggcggg gcgggatggg gctgtgggcc cctcccaccg tcagtgctgg 61620 

ccaccggagg cttcccgggt tcctgggggc tgtgccaccg cctctgaggc atgcttgctt 61680 

tcttcccttt tcaaaccctt ctgcttcctt ctttaatgac attgttgatt gtggataatc 61740 

tgaaaactac acaaaaatat aaagagccaa aatctcaccc aaatccacct cctagagtgg 61800 

ctgttgggct ccgtcagcat ccaggcggcc gtctgtgttc cgcacggccc agcccatcga 61860 

tagccgcctg caccaggcct gtctgccctc tgtgagcctc cccacagggt tccctccaca 61920 

aacaccctgt tctcccaccc agggctggct gcttcctgga aaacagctgg atggttttgt 61980 

gcatgacaga caaacacagg gtgattttcg tggctaaaat actccctgga gcttttggca 6204 0 
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?39gtgagggg ctggctccag ctgagccacg ccttgagtga aatgactgtg aggagaataa 62100 

actgccgctg ccctccagga tcactggggc tggctgggga gaacccccgt ttctgggagc 62160 

acagtcccag gatgccaagg cgagcttggt gccgagatgt gaactcctga gtgtaaacag 62220 

cgggggctga cttgacatgc tttgtatgct tttcatttgt tcctgcagct gtatgcccct 62280 

aaggtgagtc cagccccctt ctgcttcctc tggggcctcg ccagtgagcc ccaccttgct 62340 

ggggctggtt cctcctgccc ttctgggtat ccctcacatc tggggtcttg tcttcttgtt 624 00 

ttatttttct tttttttttg agacggagtt tcacttttgt tgcccaggct tcagtgcaat 62460 

ggtgtgatct ctaggctcac cgcaacctct gcctcccagg ttcaagcagt tctcctgcct 6252 0 

cagcctccct agtagctggg attacaggca tgtgccacca cgcccagcta attttgtatt 62580 

tttagtagag atggggtttc tccatgttgg tcaggctgat cttgaactcc ctacctcagg 62 64 0 

tgatccgccc accttggcct cccaaagtgc tgggattaca ggcgtgagcc accgcacctg 62700 

gcctttttct tttcttttct tttctttttt ctgagacagg gtctcgctct gtcacccagg 62760 

ctggagtgca atggtgtcat catggctaac tgcagcctct accttctagg ctcaagcaat 62 82 0 

cctcccatct cagcccctaa gtagctagga ctgcacgcat gcatccccat gcccagctaa 62880 

tatttacatt ttttgtagag atgaagtttc actatattgc ccaggctggt ctccaactcc 62 940 

tggactcgag cgatcctcct gcctcggcct ccccaggtgc tgggattaca ggcgtgagcc 63 000 

accgtgcctg gcctggggta ttgtcttctt atggcacctg actgtggtgg gccctgggaa 63060 

ggaagtagca gaagagggtt cttcttggtt tcctggacag taactgagtg ttctggaggc 6312 0 

cccagggcct ggctttgttt agggacaaag ggaactggta accagaagcc gagagtttaa 63180 

acacccactg cccttcttcc ctgctcctgc tgctgcaacc cagcttaacc agccaggagt 63240 

gctaggaacc caagcagggc ccccgagcac acagcaggca gctcacgaat tctcttttcc 63 3 00 

tgttctccct tgggagctgg gaggatctta atcaggcaat aagagatggc actgagcagc 633 60 

cagctaattt tttaaatcac tttattgttt aaccatatga ctcacccact taaaaaaggg 63420 

tacagttcag tgggttttag tgtattcaca gatgtgtgca accctcacca cagttaattt 63480 

tagaacattt tcctgcccct aaaagaaact ctgcatgaag ccagctgttt ttaaattagc 63540 

aaagttattt tgcatccttt aaatatatgt tcatggtaca aaattcaaaa gatacagaag 63 600 

agtctgcagt ccaaagagac tccgccccca tgacgccaag caggactccc tgggaggcat 63 660 

ggcctcctgc agtgtgtttc ttctatgtcc ccccaggggt catctgtaca tatgcaagca 63 72 0 

tacaagagcg tggactttgt tttccaagcc agaagataat tgtagattta tgtgcagttg 63 780 

tgagaaagag cacagaccca tttatcctct gcctggtttc ccccagtgct gcctgccatc 63 840 

ttgcatgact tccattccta tcataagcaa gacactgata acgattcttt caccttattc 63 900 

agattgacat aagtgttttt tgtttgttct tgagacaaac ttcctctgtc acccagtggg 63 960 

agtgcagtgg cacaatcaca gctcactgca gcctcaaact cctgggctca agcgattctc 6402 0 

ctgcctcagt cccctcaagt agctcagatg gcaggtgtgc accatcatgc caggctaatt 64 08 0 

tttaaatttt ttgtggaggt gaggcctcac taaatttcct gggctagtct tgaactcctg 64140 

agctaaagtg atcctcctgc ctcagcctcc caaagtggta ggattacagg catgagccac 642 0 0 

tgcgcctggg ctgacatatg tgttttcgta agcccgaaag atagcatctg aagagtcaac 64260 

attgagcctt gccttttgct gctaatgatg tataaaagct gctgttctga gcatttcgga 64320 

ggctcccagc tgccgtgtgc accctgccta gagctctacc gtaacccatc tccgggagga 643 8 0 

ggtgctattg ttttcctcat tttgcaacaa ggaggctgaa gaactgagca tgaaccactg 64440 

gcctgggtcg ttcggttggt aggcagtggg gccaggccat ccaactcaca accaccttct 645 00 

actctgcttc ccccgcaccc tgaagtttgt tctgttttga ggacacagcc gtcacattct 64560 

tggtggctga acagcactcc ttgtcaggtg tggctgggcc cccactggag ggcatcatgg 64 62 0 

tcctctctcc tgctgcggtt gaaccttggc tgtttcaacc actcctgcca agtggccctc 64680 

tgaaagggac agtccatctt ttctcagcag agggccacac tggcaaaacg gtccctggca 64 74 0 

ccctttctct ccacctgtct aatatagagt aaaaatggta tcatgttaag atcttcattt 64 800 

atatttattt tatcatgaat gatgtaagca tcattttgtg tgtttaagaa cctttgggcc 64 860 

cagcgtgatg gcttgcagct gtaatctcag cactttagga ggctgagatg agcggatcac 64 92 0 

ttgaggccgg gagtttgaga ccagcctggc caacatggag aaaccccgtc tctagtaaaa 64 980 

atttaaaaat tagccgggta tggtgatccc agctacttgg gagtctgaag catgagaatt 6504 0 

gcttgaacat gggaggcgga ggttgcagtg agccgagatc gcgccattgc actccagcct 65100 

gggcgacaga gcgagactct gtctcacaaa aaaaaaaaaa aaagaaaaga aaagaaatta 65160 

tcaatctcct cttttatggc atatatatat atatatatat atatatatat ttatttccct 65220 

ttcttggtta tgttcataaa ggcctcccct gctctgatca taaaaaacaa cttattttca 65280 
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cactctctct cttttttttt tgagacagag ttttgctcct gttgcccagg ctggagtgca 6534 0 

gtggcgcaat ctcagctcac tgtaacctcc gcctcccggg ttggagtgat tctcctgcct 654 0 0 

taccttcccg agtagctggg attataggca tgcaccacca tgcctggcta attttgtact 65460 

tttagtagag acgggggttt ctccatgttg gtcaggctgg tctcgaactc gcgacctcag 6552 0 

gtgatccacc cacctcggcc tcccaaagtg ctgggattac agacgtgagc caccatgccc 655 8 0 

agcccacact ctctttctta acgtcctcct cctttcgttt tacgttcaca tctttaattc 65640 

ttctgggatg taattagatt tgatgagcaa ggtgggcatc cagcttgttt cttggctgat 6570 0 

ggcttatggg tggcgtgaat tagtcggggt ctatcaggag gcagaaactc tatgagaatt 65760 

tgaacagaga aagttccgtc tacaggctta ttaccaggga ctggaatagc agaaattgaa 65 82 0 

cagtgagatg tacagagaac tctaagaatg caggaatagg ccaggcatgg tggctcacac 65 8 80 

ctgtcatccc agcactttgg gagaccaagg cgggtggatc acctgaggtc aggagttcga 6594 0 

gaccagcctg gccaacatag tgaaacccca tctctactaa aaatacaaaa aaattagctg 660 0 0 

ggtgtggtgg cgcatgcctg taatcccagc ttctcgggag tctgaggctg gagaatcact 66060 

tgaacctggg aggcagaggt tgtagtgagc cgagatcatg ccattgtact ccagcctggg 6612 0 

caacaagagc gagactcagt caaaacaaca acaacgcagg aatagcagat gagccgaggt 6618 0 

9999cctccc cagcccccac cccccacccc gcaccctggg ccgagatcca gtcctctttg 66240 

aatagggcct gggcgtggtt cacgggacat ctgagacatt gccgaggcgc tgcactggtg 663 00 

gatcttgcca gaagtctgcc cagtgcagat ttgggcagaa tctcaaactg ccttgggatg 663 6 0 

taggagagaa accaggcctg gtcaagttca tgggaagagg tggaaacaga ccccataggc 664 2 0 

tggggcttgg gcagctgtag gaagccctct ctgctgcctc cctgcctgct ctctgctttg 664 8 0 

aagcatcttc cccagtgccc ccagtctcat gccctctcaa cgttggggtc aaatcctgag 6654 0 

gaatacccag actggctctc tgggccaaag aggaccctct ccagaaagag cagggcccag 6660 0 

tgcggcttcc taaagggcag gggaagggcc tggccactcc ccagaggcta ctcaccagcc 66660 

atcaggatag ccccaggaag caggccttct cgagcccatt ttattacttt attttattat 6672 0 

tttatttaat tttaaattta ttttttgaga cagagtctca ctctgttgcc caggctggag 667 8 0 

tgcagtggtg cgatctcaac ccactgcagc ctctgcctcc agggttcaag ggattctccc 6684 0 

acctcagcct cccaagtagc tgggattaca ggtgcccgcc accacacccg gctaattttc 66900 

atatttttag tagagacgag gtttcaccat gttggccagg ctggtctcga actcctgacc 66960 

tcaagtgatc cgcccgcctc ggcctcccaa agtgctaggt caagcccatt ttaaagttga 6702 0 

agaaactgag gctgaggtaa attccctccc cagggatcct gctgcagcca gaaggtggta 67080 

aaacaggact tcacccgggt ctgtctggcg tgaaaggcag tgttcttgta ccaccctagg 6714 0 

gggcctgaga gaactgagtc cctcgggcat aactgacagt tctgttccca ttattccgca 672 0 0 

9999ctcgga tctggctgta tgctttccag gatggccttg gagacccaca taagccctac 672 60 

accctttggg aagctgcatg ttgggttggg gtgccgtcag tggcacttgt ggaaggtgca 673 2 0 

gacctgtgtg ggtgtgtggg cccagggccc ctggtccctt cctccctttg tagggctggt 673 8 0 

tgtgtgctgc ctggacctgg ggggcacgtt cacgtggtga atttgtctat ttactatccc 6744 0 

cgctttgggg ctggtgccag cacaggccct tgtgaagggg gtgcctttgt ctggagtggg 675 0 0 

actgtggccc ctccctcagc gtggtgactt ctgtgtcagg gcttcagcag ggacgcagag 67560 

cccctgagtg ttcggaacaa gggcgtcatt gcaggagtta gactgtgtgt gatggaggga 67 62 0 

99 a 999gcag gaggaaaggt cagaaggaga gttcctggga aggtccctga ggagcctggt 6768 0 

gaggtgctaa ctggtgtgga ggacactcag ggcctgtggg gacatctcct actgctgggg 6774 0 

gccagccaca aagggaactg gccgaagtcc tgtccccgcc ttcacagccc agcatctggt 678 0 0 

cacaaggcag gtacttggaa gggcgcgggc acctgggcca aaagtgcctg ggttcccttt 67860 

gcctttcact gagatgacct tcggggcagg tggctgctgc ctcccctcct gtccccaggt 67 92 0 

tttgccaact ggccagagga aggggtcctg ggaagcaggg gggccagaag ccctctctgc 67980 

aaggaaagcc cgaggggtgt gggaggaagg aaggaatgcc caggctggcg aggctctaag 68 04 0 

tcaccctggc ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc 68100 

tccctgtaca acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac 6816 0 

aggtaggaca tcccctgcag ccctccatgg ccattgggtfc cccgccagcc cgtggtggag 6822 0 

gggcctaatc cccatgccac tgatgagggg aggtattctg ggtgctagtg ggcaggtgcc 68280 

gggcccagcc ctgcctccct ctgctctgcc aaccacacta ggctgcctcc ccagacaagc 6834 0 

tcagcgggca ctgcatgttg ggttcagaaa tcagcagaac tccacgttct gagctgctct 684 0 0 

tcaagttgct cctatggggg ttacttttaa gctgggaaat ggctgtggcg tcgaggggcc 68460 

99999Cttgg gctccaaact ctgactgtgt gtttgagtcc ggctgtggaa acctagccat 6852 0 
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tgagatgccc cctcttggtg gctctgtcct cttaggatgg gacaagtctg tgaaggctgc 68580 

tgcagcaccc accgtagacc cctaatcgtg tgacgtcacc aggatggtcc gggctgctca 6864 0 

cttgccacag tggcctgttt gagcccggga agccaacggg gctgctcagc tggacaccag 68700 

ccccccgagc tgcccatgtt ggggtcacag gccccacctc cctggttggg gaggggcaac 68760 

tgagagtgtg gagaggtggg acccaggtgt gctggtctcc gcaggggctg gatcagagcc 68820 

tgggatgggc agggtgagcc tcctgacctt taacccagtg gtgtcaggca acgtggccca 6 8880 

cccgccagcc gcaccaggcc ccacccccgc aggtgaaggg gtgggatagg ctgggcctgg 6894 0 

gccaggacac ctctggacca cgcattcctc attgcttggg tccctggagc agcagggcct 69000 

cccgagtgtg gtgccgcctg ccacctagtg gccatttcca cgaactccca ggcctggctg 69060 

gggagccgga actgcagcct ccatttccac cccactccgg gtcgggccac ctccctgatg 6912 0 

cctcagtatt atatcaaact gtcacagtct gtcccacagc cttacagacc actgtctcca 6918 0 

gaatggtcac atccacactg ggcagcccag tctcgctagt tcctcgtccc acctcctgcc 69240 

tttgctcatg cccgtcctgc tctgggccca ccgcggacac atcttccccc cgcccgccgt 693 00 

ctgacctcac agcagctggg ccccaagagg agtatcctgt cctgctgcac ttttctcaac 69360 

acccggtgtt ggctgcacct tcccacccat tgcaggcccc tctgtgacag gacgggggct 6942 0 

cctaaacaca ccacagttcc gagtctgaac tcacacagtg ggatgcggcg tttctgggcc 694 80 

acagttgggt gcaggtagcc tctgggagga tgggaggtca ggagccatct tgcgagtcag 6954 0 

gttgcttgaa ctcaggatgg aagtgttccg ggcccattgg ttgctgtatt agcctgttct 69600 

cacgctgcta ataaagacat acccaagact gggtaattgt aaaggaaaga ggtttaacgg 69660 

actcacagtt ccacctgcct ggggtggcct cacaatcatg gtagaagaca aggaggagca 6972 0 

agtcacatct tacatggctt cagggaacag acagcatgag aaccaagcga aaggggtttc 69780 

cccttgtaaa accatcaagt ctagtgagat ttattcacta ccacgagaac agtatggggg 6984 0 

gaaccacccc catgattcaa tcatctccca ctgggtccct cccacagcac gtgggaatta 69900 

tgggagtaca attcaagatg agatttgggt ggggacacag ccaaacccta tcggttgcca 6996 0 

acatttacag taacagtgtt aggtgaacag ttgtccagtc tcctgttttg tcggacactg 7002 0 

tttctagcac cttccaggca gaatctcatg tatccttcac tttcgaaatg ggtactattt 70080 

catccccact tttatcaatg agaaactaaa gctcgaagag gtcaagtaag ttcctggcca 7 014 0 

aggtcagcta gcaggctcta gaggcctcgt tctccttaga ggcagccttg ccagggccca 7 0200 

ggcttggcag gctgcagggc aggtgcgggc atgcccatgg tagaggtggg accattgagg 702 60 

ctcagagagg gtaagtgatg agccctggcg acacagcggg gtgggtccag agtccggcct 7032 0 

gcatcttctg gagctggcca gtggacaggc ctttcccgtt cacagccccg gggctgctgt 70380 

gcccaccagg gcggatgtgc ctaccgaatc ccactcctct gtgtgtgtcc ctttcaggcc 7 044 0 

ctacatcatt cgaggaatgg cgcccccgac gacgccctgc agcaccgacg tgtgfcgacag 7050 0 

cgactacagc gccagccgct ggaaggccag caagtactac ctggatttga actcggactc 7 0560 

agacccctat ccacccccac ccacgcccca cagccagtac ctgtcggcgg aggacagctg 70 62 0 

cccgccctcg cccgccaccg agaggagcta cttccatctc ttcccgcccc ctccgtcccc 70680 

ctgcacggac tcatcctgac ctcggccggg ccactctggc ttctctgtgc ccctgtaaat 70740 

agttttaaat atgaacaaag aaaaaaatat attttatgat ttaaaaaata aatataattg 70800 

ggattttaaa aacatgagaa atgtgaactg tgatggggtg ggcagggctg ggagaacttt 7086 0 

gtacagtgga gaaatattta taaacttaat tttgtaaaac agaactgcca ttcttttgtg 70920 

ccctgtgtgc atttgagttg tgtgtccccg tggagggaat gccgaccccc ggaccaccat 70980 

gagagtcctc ctgcacccgg gcgtccctct gtccggctcc tgcagggaag ggctggggcc 7104 0 

ttgggcagag gtggatatct cccctgggat gcatccctga gctgcaggcc gggccggctt 71100 

tatgtgcgtg tggcctgtgc cgtcagaaag ggccctgggc ttcatcacgc tgttgctgtt 71160 

cgtcttcctc agattcttag tctttttttt tttttttttt ttttgagacg gagtctttct 7122 0 

ctgtcatcca ggctggagtg cagtggtaca atctcagctc actgcaagct ccgactccca 712 80 

ggttcaagtg agtctcctgc ctcagcctcc cgagtagctg ggactacagg tgcgcgccac 71340 

cacacccgcc cagctaattt ttgtattttt agtagagatg gggtttcacc atgttggcca 71400 

ggatgatctc gatctcttga cctcgtgatc cgcccacctc ggcctcccaa agtgctggga 71460 

ttataggcat gagccactgt acccagctga ctcttagtca cttttaagaa ggggactgtg 71520 

ccttcatttt tcactgggcc ctgcagaata tatgcctggg ctctgggctc ttctgaacct 71580 

gtgttggctt ccatctgacc tctctgtgcc agcccaaggc tgctgctctt cctgagggca 71640 

aggagcccca tgactgcgtg ttgactcgct ggatggggct gctgagccca ctctgccaca 71700 

ccacgtgccc ctggcaggga gggaatccct gggtcctcac aggaacagtc agcaagccac 71760 
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acctgacgcc 
actgcctctg 
ggaagctgag 
gggagaaaat 
gaggtcagaa 



tgctgtgggc 
cagggtcccc 
ctgttctcgt 
gcacatcgca 
aacagtcaag 



ccatccctgc ggtgctggag aagacagaca aggcctggtc 71820 

agtccgtgga aggagacagt aatctaggca ttttcggtgg 71880 

gtcctgaagg ccaggcggga acagccgtct tcagagggaa 71940 

tcagtggaga agggcctgac ttccctcagc atggtggagg 72 00 0 

cttgagtatt ctatagtgtc acctaaata 72049 



<210> 10 

<211> 8705 

<212> DNA 

<213> Homo sapiens 



<400> 10 

ggactcaggg 

agagggtgaa 

ctaattaaag 

gcccaacagg 

cgtacatcag 

ggggagtaca 

tcagacccgg 

gccgagatcc 

tgccgaggcg 

atctcaaact 

gtggaaacag 

ccctgcctgc 

acgttggggt 

tccagaaaga 

cccagaggct 

tttattactt 

actctgttgc 

cagggttcaa 

caccacaccc 

gctggtctcg 

tcaagcccat 

tgctgcagcc 

gtgttcttgt 

ttctgttccc 

ggagacccac 

gtggcacttg 

tcctcccttt 

aatttgtcta 

ggtgcctttg 

ggcttcagca 

agactgtgtg 

aaggtccctg 

ggacatctcc 

cttcacagcc 

aaaagtgcct 

cctcccctcc 

ggggccagaa 

ccaggctggc 

ccctccccgg 

ccggccactg 

tcccgccagc 

gggtgctaat 



gcagcaggga 
tcgatggaac 
agacggggca 
gaggatatgc 
tggagatatg 
gcaccagcca 
atccaccctg 
agtcctcttt 
ctgcattggt 
gccttgggat 
accccatagg 
tctctgcttt 
caaatcctga 
gcagggccca 
actcaccagc 
tattttatta 
ccaggctgga 
gggattctcc 
ggctaatttt 
aactcctgac 
tttaaagttg 
agaaggtggt 
accaccctag 
attattccgc 
ataagcccta 
tggaaggtgc 
gtagggctgg 
tttactatcc 
tctggagtgg 
gggacgcaga 
tgatggaggg 
aggagcctgg 
tactgctggg 
cagcatctgg 
gggttccctt 
tgtccccagg 
gccctctctg 
gaggctctaa 
ccacggaccc 
cgagaccgta 
ccgtggtgga 
gggcaggtgc 



ggtacaccca 
ctgggggaca 
gtggacaggc 
gtacagaatc 
ggtcccccga 
tcagcaggcc 
accgacgtcc 
gaatagggcc 
ggatcttgcc 
gtaggagaga 
ctggggcttg 
gaagcatctt 
ggaataccca 
gtgcggcttc 
catcaggata 
ttttatttaa 
gtgcagtggt 
cacctcagcc 
catattttta 
ctcaagtgat 
aagaaactga 
aaaacaggac 
ggggcctgag 
aggggctcgg 
caccctttgg 
agacctgtgt 
ttgtgtgctg 
ccgctttggg 
gactgtggcc 
gcccctgagt 
aggaggggca 
tgaggtgcta 
ggccagccac 
tcacaaggca 
tgcctttcac 
ttttgccaac 
caaggaaagc 
gtcaccctgg 
ctccctgtac 
caggtaggac 

ggggcctaat 

cgggcccagc 



tggttagtgg 

caatcgaagt 

attttcagtt 

atactcgatc 

tgactccgtg 

agtaaatcat 

caagccccca 

tggccgtggt 

agaagtttgc 

aaccaggcct 

ggcagctgta 

ccccagtgcc 

gactggctct 

ctaaagggca 

gccccaggaa 

ttttaaattt 

gcgatctcaa 

tcccaagtag 

gtagagatga 

ccgcccgcct 

ggctgaggta 

ttcacccggg 

agaactgagt 

atctggctgt 

gaagctgcat 

gggtgtgtgg 

cctggacctg 

gctggtgcca 

cctccctcag 

gttcggaaca 

ggaggaaagg 

actggtgtgg 

aaagggaact 

ggtacttgga 

tgagatgacc 

tggccagagg 

ccgaggggtg 

cttggctctc 

aacatggaca 

atcccctgca 

ccccatgcca 

cctgcctccc 



gcggaccata 
ggttccagag 
gactgcccag 
agcatgagtc 
gaacactgat 
accggcctgc 
ccccccaccc 
tcacgggaca 
ccagtgcaga 
ggtcaagttc 
ggaagccctc 
cccagtctca 
ctgggccaaa 
ggggaagggc 
gcaggccttc 
attttttgag 
cccactgcag 
ctgggattac 
ggtttcacca 
cggcctccca 
aattccctcc 
tctgtctggc 
ccctcgggca 
atgctttcca 
gttgggttgg 
gcccagggcc 

gggggcacgt 

gcacaggccc 
cgtggtgact 
agggcgtcat 
tcagaaggag 
aggacactca 
ggccgaagtc 
agggcgcggg 
ttcggggcag 
aaggggtcct 
tgggaggaag 
ctcagatcct 
tgttctactc 
gccctccatg 
ctgatgaggg 
tctgctctgc 



gggggtaatg 

tcgggctgta 

ggagtgttct 

caattcagac 

gtttgtgaca 

gaaattggac 

cccaccatgg 

tctgagacat 

tttgggcaga 

atgggaagag 

tctgctgcct 

tgccctctca 

gaggaccctc 

ctggccactc 

tcgagcccat 

acagagtctc 

cctctgcctc 

aggtgcccgc 

tgttggccag 

aagtgctagg 

ccagggatcc 

gtgaaaggca 

taactgacag 

ggatggcctt 

ggtgccgtca 

cctggtccct 

tcacgtggtg 

ttgtgaaggg 

tctgtgtcag 

tgcaggagtt 

agttcctggg 

gggcctgtgg 

ctgtccccgc 

cacctgggcc 

gtggctgctg 

gggaagcagg 

gaaggaatgc 

gaacccgccg 

ttcaaacatt 

gccattgggt 

gaggtattct 

caaccacact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 
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aggctgcctc cccagacaag ctcagcgggc actgcatgtt gggttcagaa atcagcagaa 2580 

ctccacgttc tgagctgctc ttcaagttgc tcctatgggg gttactttta agctgggaaa 2640 

tggctgtggc gtcgaggggc cgggggcttg ggctccagag tctgactgtg tgtttgagtc 2700 

cggctgtgga aacctagcca ttgagatgcc ccctcttggt ggctctgtcc tcttaggatg 2760 

ggacaagtct gtgaaggctg ctgcagcacc caccgtagac ccctaatcgt gtgacgtcac 2 820 

caggatggtc cgggctgctc acttgccaca gtggcctgtt tgagcccggg aagccaacgg 2880 

ggctgctcag ctggacacca gccccccgag ctgcccatgt tggggtcaca ggccccacct 294 0 

ccctggttgg ggaggggcaa ctgagagtgt ggagaggtgg gacccaggtg tgctggtctc 3 0 00 

cgcaggggct ggatcagagc ctgggatggg cagggtgagc ctcctgacct ttaacccagt 3 060 

ggtgtcaggc aacgtggccc acccgccagc cgcaccaggc cccacccccg caggtgaagg 312 0 

ggtgggatag gctgggcctg ggccaggaca cctctggacc acgcattcct cattgcttgg 3180 

gtccctggag cagcagggcc tcccgagtgt ggtgccgcct gccacctagt ggccatttcc 3240 

acgaactccc aggcctggct ggggagccgg aactgcagcc tccatttcca ccccactccg 3300 

ggtcgggcca cctccctgat gcctcagtat tatatcaaac tgtcacagtc tgtcccacag 3360 

ccttacagac cactgtctcc agaatggtca catccacact gggcagccca gtctcgctag 342 0 

ttcctcgtcc cacctcctgc ctttgctcat gcccgtcctg ctctgggccc accgcggaca 3480 

catcttcccc ccgcccgccg tctgacctca cagcagctgg gccccaagag gagtatcctg 3540 

tcctgctgca cttttctcaa cacccggtgt tggctgcacc ttcccaccca ttgcaggccc 3600 

ctctgtgaca ggacgggggc tcctaaacac accacagttc cgagtctgaa ctcacacagt 3 660 

gggatgcggc gtttctgggc cacagttggg tgcaggtagc ctctgggagg atgggaggtc 372 0 

aggagccatc ttgcgagtca ggttgcttga actcaggatg gaagtgttcc gggcccattg 3780 

gttgctgtat tagcctgttc tcacgctgct aataaagaca tacccaagac tgggtaattg 3 840 

taaaggaaag aggtttaacg gactcacagt tccacctgcc tggggtggcc tcacaatcat 3 900 

ggtagaagac aaggaggagc aagtcacatc ttacatggct tcagggaaca gacagcatga 3960 

gaaccaagcg aaaggggttt ccccttgtaa aaccatcaag tctagtgaga tttattcact 4 02 0 

accacgagaa cagtatgggg ggaaccaccc ccatgattca atcatctccc actgggtccc 4080 

tcccacagca cgtgggaatt atgggagtac aattcaagat gagatttggg tggggacaca 4140 

gccaaaccct atcggttgcc aacatttaca gtaacagtgt taggtgaaca gttgtccagt 4200 

ctcctgtttt gtcggacact gtttctagca ccttccaggc agaatctcat gtatccttca 4260 

ctttcgaaat gggtactatt tcatccccac ttttatcaat gagaaactaa agctcgaaga 432 0 

ggtcaagtaa gttcctggcc aaggtcagct agcaggctct agaggcctcg ttctccttag 4380 

aggcagcctt gccagggccc aggcttggca ggctgcaggg caggtgcggg catgcccatg 4440 

gtagaggtgg gaccattgag gctcagagag ggtaagtgat gagccctggc gacacagcgg 4500 

ggtgggtcca gagtccggcc tgcatcttct ggagctggcc agtggacagg cctttcccgt 4 5 60 

tcacagcccc ggggctgctg tgcccaccag ggcggatgtg cctaccgaat cccactcctc 4 62 0 

tgtgtgtgtc cctttcaggc cctacatcat tcgaggaatg gcgcccccga cgacgccctg 468 0 

cagcaccgac gtgtgtgaca gcgactacag cgccagccgc tggaaggcca gcaagtacta 474 0 

cctggatttg aactcggact cagaccccta tccaccccca cccacgcccc acagccagta 4800 

cctgtcggcg gaggacagct gcccgccctc gcccgccacc gagaggagct acttccatct 4 860 

cttcccgccc cctccgtccc cctgcacgga ctcatcctga cctcggccgg gccactctgg 4920 

cttctctgtg cccctgtaaa tagttttaaa tatgaacaaa gaaaaaaata tattttatga 4980 

tttaaaaaat aaatataatt gggattttaa aaacatgaga aatgtgaact gtgatggggt 5 04 0 

gggcagggct gggagaactt tgtacagtgg agaaatattt ataaacttaa ttttgtaaaa 5100 

cagaactgcc attctttcgt gccctgtgtg catttgagtt gtgtgtcccc gtggagggaa 516 0 

tgccgacccc cggaccacca tgagagtcct cctgcacccg ggcgtccctc tgtccggctc 522 0 

ctgcagggaa gggctggggc cttgggcaga ggtggatatc tcccctggga tgcatccctg 5280 

agctgcaggc cgggccggct ttatgtgcgt gtggcctgtg ccgtcagaaa gggccctggg 534 0 

cttcatcacg ctgttgctgt tcgtcttcct cagattctta gtcttttttt tttttttttt 5400 

ttttttgaga cggagtcttt ctctgtcatc caggctggag tgcagtggta caatctcagc 5460 

tcactgcaag ctccgactcc caggttcaag tgagtctcct gcctcagcct cccgagtagc 552 0 

tgggactaca ggtgcgcgcc accacacccg cccagctaat ttttgtattt ttagtagaga 5580 

tggggtttca ccatgttggc caggatgatc tcgatctctt gacctcgtga tccgcccacc 564 0 

tcggcctccc aaagtgctgg gattataggc atgagccact gtacccagct gactcttagt 5700 

cacttttaag aaggggactg tgccttcatt tttcactggg ccctgcagaa tatatgcctg 576 0 
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ggctctgggc 
gctgctgctc 
ctgctgagcc 
acaggaacag 
agaagacaga 
gtaatctagg 
gaacagccgt 
acttccctca 
tgcatttaat 
ttgggaggct 
atggcaaaac 
aatcccagct 
gcagtgagcc 
caaaaaaaaa 
atggaagacc 
ataataacat 
tggctcacgc 
acgagtttta 
aattagccag 
gagaatcact 
cccacctggg 
aaaaaacaag 
ctaggccagg 
gatcacctga 
ctaaaaatac 
ccccagctac 
agtgagccag 
aaaaaaaaaa 
gaaataataa 
tcaggagttt 
aaaaaaaaaa 
aggctgaggt 
caccctgcac 
agcagaaata 
cgaaaagata 
caaataaata 
aggatcatta 
ctgggtaaat 
ctgaacaggc 
tcaggagttc 
aaaaattagc 
aggaaaatca 
ctccagcctt 
aacagaccaa 
gcccaggaac 
gagtctcgct 
tatgcctctc 
gatggtactt 
gcttccaaca 
aaaag 



tcttctgaac 
ttcctgaggg 
cactctgcca 
tcagcaagcc 
caaggcctgg 
cattttcggt 
cttcagaggg 
gcatggtgga 
aatcaaaata 
gaggcaggtg 
ctcaactact 
acttgggagg 
gagattgtgc 
aaaaaaaaaa 
caaaacaaac 
tgactgtaaa 
ctgtaattcc 
agaccagcct 
gcggtagtgg 
ggaggctggg 
cgacagagtg 
acccaaccat 
tgtggtggct 
ggccaggagt 
aaaaaaacaa 
ttgggaggct 
gatggtgcca 
gaaaagaaaa 
agatcagaac 
gagaccagtc 
ataggctgga 
gggagaatca 
tctagctggg 
aatgaaatgg 
aacaaaatca 
aaatccgaga 
gaggcaacta 
ttctagacac 
cgggcacggt 
gagaccagcc 
cgggcgtggt 
cttgaacctg 
ggcgacagag 
taacaagtaa 
aaatggcttc 
ctgtcgccca 
aggctgcaag 
gtcaaacaag 
aaggttgggt 



ctgtgttggc 
caaggagccc 
caccacgtgc 
acacctgacg 
tcactgcctc 
ggggaagctg 
aagggagaaa 
gggaggtcag 
taggctgggt 
gatcacttga 
aaaatacaaa 
ctgaggcagg 
ca.ctgca.ctc 
gcaatacaaa 
agaaaacaac 
taggctaagc 
aacgctttgg 
ggccaacaag 
cacgcacctg 
aagcggaggt 
agatcctgtc 
ctcttgcata 
cacacctgta 
ttcagaccag 
aaaaaagaaa 
gaggcaggag 
ctgcactcca 
gaaaaacgag 
aggccaggct 
tggccaacat 
tgtggtgatg 
cttgagccca 
taactggagt 
aaatgaagga 
acaaaccttt 
ttaaaaagga 
tgtgcaacta 
atacaaccta 
ggcttacgcc 
tggccaacat 

ggcgggtgcc 
ggaggcatag 
ccagactcca 
tgcgatgaaa 
actgctgaat 
ggctggagtg 
tgattttcct 
gcctggctaa 
tacggctggc 



ttccatctga 
catgactgcg 
ccctggcagg 
cctgctgtgg 
tgcagggtcc 
agctgttctc 
atgcacatcg 
aaaacagtca 
acggtggctc 
ggccaggagt 
aactagccgg 
agaattgctt 
cagcctgggc 
atacaaatat 
aaaatggcag 
tctgcaatca 
gaggctgagg 
gtgaaacccc 
taatcccagc 
tgctgtgagc 
ttaagaaaaa 
caagaaacac 
atcccagccc 
cttgaccgac 
aaggctggaa 
aattgcttga 
gcctgggtga 
aaaaacaaac 
catgggcaca 
ggcaaaaccc 
tgtgtgtgcc 
ggaagtggag 
gagattctgt 
aacaatgcaa 
agccagatta 
gacattacca 
tatgctaatg 
tcaagattga 
tgtaatccca 
ggtgaaaccc 
tctaatgtca 
gttgcagcga 
tctcaaaaaa 
actgtaataa 
tttaccaaac 
cagtggtgta 
gcttcaggcc 
atttctatat 
ttctgaaaat 



cctctctgtg 
tgttgactcg 
gagggaatcc 
gcccatccct 
ccagtccgtg 
gtgtcctgaa 
catcagtgga 
agcttgttgc 
atgcctgtaa 
ttgagaccgg 
gcgtggtggt 
gaacctggga 
aacagagcaa 
cactttcact 
gagtaagtcc 
aaagagtggg 
cggatggatc 
atctgtacta 
tacttgtgag 
caagatggag 
aaagagtgga 
actttaccta 
tttgggaggc 
atggcaaaac 
gtagtgatgt 
atccgggaag 
cagagcgaga 
acaaaattag 
gtggctcaac 
catctctcct 
tgtagcccca 
gctgcagcga 
ctcaaaaaag 
aatgatacaa 
agaaaaaaag 
ctgataccac 
aactggaaaa 
accatgaaga 
gcactttgga 
catctctact 
gccactcggg 
gccgaggttg 
attaaaataa 
aatgtttccc 
attttttttt 
acctcggttc 
ccccgagtgg 
ttccttcaag 
cttggatttc 



ccagcccaag 
ctggatgggg 
ctgggtcctc 
gcggtgctgg 
gaaggagaca 
ggccaggcgg 
gaagggcctg 
tgggtgacag 
tcccagcact 
cctggccaac 
gcacgcctgt 
ggcggaggct 
gactctgtct 
aaaagaaggg 
ccacttatca 
ccaggagcgg 
atttgatgtc 
aaaatacaaa 
gctgaggcag 
ccactgcact 
tgaatggatc 
taaaaacaca 
ctgactggca 
cccatctctc 
gtgcctgtag 
tggaggttgc 
ccctgtcata 
tagaagaaaa 
tcctacctgc 
aaaaatatga 
gctacttggg 
gtcatgaatg 
caaagaccag 
aaagtttttt 
agagaagacc 
agaaattcaa 
cctagaagaa 
aatccaaaac 
aggcctgaga 
gaaaaaatat 
aggctgaggc 
caccactgca 
caaaaacctg 
aacaaagaaa 
ttttgagacg 
gctggtaact 
ctggaaatta 
tagaagatgt 
aaggctcccc 



5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
63 60 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8705 



<210> 11 
<211> 66933 
<212> DNA 
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<213> Homo sapiens 
<400> 11 

tataatcaag cgcgttccgt ccagtccggt gggaagattt tcgatatgct tcgtgatctg 60 

ctcaagaacg ttgatcttaa agggttcgag cctgatgtac gtattttgct taccaaatac 120 

agcaatagta atggctctca gtccccgtgg atggaggagc aaattcggga tgcctgggga 18 0 

agcatggttc taaaaaatgt tgtacgtgaa acggatgaag ttggtaaagg tcagatccgg 24 0 

atgagaactg tttttgaaca ggccattgat caacgctctt caactggtgc ctggagaaat 3 00 

gctctttcta tttgggaacc tgtctgcaat gaaattttcg atcgtctgat taaaccacgc 360 

tgggagatta gataatgaag cgtgcgcctg ttattccaaa acatacgctc aatactcaac 420 

cggttgaaga tacttcgtta tcgacaccag ctgccccgat ggtggattcg ttaattgcgc 48 0 

gcgtaggagt aatggctcgc ggtaatgcca ttactttgcc tgtatgtggt cgggatgtga 540 

agtttactct tgaagtgctc cggggtgata gtgttgagaa gacctctcgg gtatggtcag 600 

gtaatgaacg tgaccaggag ctgcttactg aggacgcact ggatgatctc atcccttctt 660 

ttctactgac tggtcaacag acaccggcgt tcggtcgaag agtatctggt gtcatagaaa 72 0 

ttgccgatgg gagtcgccgt cgtaaagctg ctgcacttac cgaaagtgat tatcgtgttc 780 

tggttggcga gctggatgat gagcagatgg ctgcattatc cagattgggt aacgattatc 840 

gcccaacaag tgcttatgaa cgtggtcagc gttatgcaag ccgattgcag aatgaatttg 90 0 

ctggaaatat ttctgcgctg gctgatgcgg aaaatatttc acgtaagatt attacccgct 960 

gtatcaacac cgccaaattg cctaaatcag ttgttgctct tttttctcac cccggtgaac 1020 

tatctgcccg gtcaggtgat gcacttcaaa aagcctttac agataaagag gaattactta 1080 

agcagcaggc atctaacctt catgagcaga aaaaagctgg ggtgatattt gaagctgaag 114 0 

aagttatcac tcttttaact tctgtgctta aaacgtcatc tgcatcaaga actagtttaa 1200 

gctcacgaca tcagtttgct cctggagcga cagtattgta taagggcgat aaaatggtgc 1260 

ttaacctgga caggtctcgt gttccaactg agtgtataga gaaaattgag gccattctta 13 2 0 

aggaacttga aaagccagca ccctgatgcg accacgtttt agtctacgtt tatctgtctt 1380 

tacttaatgt cctttgttac aggccagaaa gcataactgg cctgaatatt ctctctgggc 144 0 

ccactgttcc acttgtatcg tcggtctgat aatcagactg ggaccacggt cccactcgta 1500 

tcgtcggtct gattattagt ctgggaccac ggtcccactc gtatcgtcgg fcctgattatt 1560 

agtctgggac cacggtccca ctcgtatcgt cggtctgata atcagactgg gaccacggtc 162 0 

ccactcgtat cgtcggtctg attattagtc tgggaccatg gtcccactcg tatcgtcggt 168 0 

ctgattatta gtctgggacc acggtcccac tcgtatcgtc ggtctgatta ttagtctgga 174 0 

accacggtcc cactcgtatc gtcggtctga ttattagtct gggaccacgg tcccactcgt 1800 

atcgtcggtc tgattattag tctgggacca cgatcccact cgtgttgtcg gtctgattat 1860 

cggtctggga ccacggtccc acttgtattg tcgatcagac tatcagcgtg agactacgat 192 0 

tccatcaatg cctgtcaagg gcaagtattg acatgtcgtc gtaacctgta gaacggagta 1980 

acctcggtgt gcggttgtat gcctgctgtg gattgctgct gtgtcctgct tatccacaac 2 04 0 

attttgcgca cggttatgtg gacaaaatac ctggttaccc aggccgtgcc ggcacgttaa 2100 

ccgggctgca tccgatgcaa gtgtgtcgct gtcgacgagc tcgcgagctc ggacatgagg 2160 

ttgccccgta ttcagtgtcg ctgatttgta ttgtctgaag ttgtttttac gttaagttga 222 0 

tgcagatcaa ttaatacgat acctgcgtca taattgatta tttgacgtgg tttgatggcc 22 8 0 

tccacgcacg tfcgtgatatg tagatgataa tcattatcac tttacgggtc ctttccggtg 2 34 0 

atccgacagg ttacggggcg gcgacctcgc gggttttcgc tatttatgaa aattttccgg 24 0 0 

tttaaggcgt ttccgttctt cttcgtcata acttaatgtt tttatttaaa ataccctctg 2460 

aaaagaaagg aaacgacagg tgctgaaagc gagctttttg gcctctgtcg tttcctttct 252 0 

ctgtttttgt ccgtggaatg aacaatggaa gtccgagctc atcgctaata acttcgtata 258 0 

gcatacatta tacgaagtta tattcgatgc ggccgcaagg ggttcgcgtc agcgggtgtt 2 64 0 

9gcgggtgtc ggggctggct taactatgcg gcatcagagc agattgtact gagagtgcac 27 00 

catatgcggt gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgccat 2760 

tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta 2 82 0 

cgccagctgg cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt 2880 

tcccagtcac gacgttgtaa aacgacggcc agtgaattgt aatacgactc actatagggc 2 940 

gaattcgagc tcggtacccg gggatcctct agagtcgacc tgcaggcatg caagcttctc 3 00 0 

ttgtgccggt tgtacgctgt caggtcacac tggtgagtta ggcagggcac agatgcccag 3060 
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agcagaggga actttccttg gggattcaac acgtgcaagt cttaggggct ggcaaatcct 3120 

gccctcagct agagaggggg cttttatttg agaccagaat cacctgagca tcctcctgtc 3180 

cccagctgtg tccagcctgt ctgcagggac atcctgagag gaccaggctc tcccctcatc 324 0 

cacctgccta agtgccactc tgaaccctgt ccacctgtgc cgtggagggg cgtgacctca 3 3 00 

agctgctcag ccagcagcag gcttggccct ggggggcagc agagacccag gtggctgtgg 3360 

ggtgggtgct tcgtggcgtg gttctgaaac ttcgttggaa gtgtgtggac agtgccttgc 342 0 

ctgttctctg tgggacccta tttagaaacg aggtctgagt tactgggggt catcactgtg 3480 

ttctgatggc ccagctgtgt ggaggccgcg gtgcagcccc atccaaggag ccagggccct 354 0 

gggtctagcc gtgaccagaa tgcatgcccc ggaggtgttt ctcatctcgc acctgtgttg 3600 

cctggtgtgt caagtggtcg tgaaactctg tgttagctct tggtgttcct gaaagtgccc 3660 

ccgggtctca ggcctcagaa ccagggtttc ccttcatctc ggtggcctgg gagcatctgg 3720 

gcagttgagc aaagagggcg attcacttga aggatgtgtc tggccctgcc taggagcccc 37 80 

ccggcacggt gctggggcct gaagctgccc tcgggtggtg gagaggaggg agcgatgaag 3 84 0 

tggcgtcgag ctgggcagga agggtgagcc cctgcaaggt gggcatgctg gggacgctga 3 90 0 

gcagcatggc cagcagctgg gtctgcagcc tggtacccgg cgggacttgt ggttggggct 3 960 

ggtttgtggc caggagaggg gctggcagga gacaaggggg actgtgaggc agctcccacc 402 0 

cagcagctga agcccaatgg cctggctgtg tggctctcag ctgcgtgcat aacctctcag 4080 

tgcttcagtt ctctcatttg taaaatgagg aaacaaacag tgccagcctc ccagaggtgt 4140 

catgaggatg aacgagtgac catgtagcat gggctgggtg cgtgtcacct aacatcacca 4200 

gcctttgcaa ggagagccct gggggcctgg ctgagtattt cccttgcccg gcccacccca 4260 

ggcctagact tgtgcctgct gcaggccctt gacccctgac cccattgcac ctgtctccac 432 0 

aggagccgag gaggtgctgc tgctggcccg gcggacggac ctacggagga tctcgctgga 43 80 

cacgccggac ttcaccgaca tcgtgctgca ggtggacgac atccggcacg ccattgccat 444 0 

cgactacgac ccgctagagg gctatgtcta ctggacagat gacgaggtgc gggccatccg 4500 

cagggcgtac ctggacgggt ctggggcgca gacgctggtc aacaccgaga tcaacgaccc 4 560 

cgatggcatc gcggtcgact gggtggcccg aaacctctac tggaccgaca cgggcacgga 4 62 0 

ccgcatcgag gtgacgcgcc tcaacggcac ctcccgcaag atcctggtgt cggaggacct 4 680 

ggacgagccc cgagccatcg cactgcaccc cgtgatgggg taagacgggc gggggctggg 4740 

gcctggagcc agggccaggc caagcacagg cgagagggag attgacctgg acctgtcatt 4800 

ctgggacact gtcttgcatc agaacccgga ggagggcttg ttaaaacacc ggcagctggg 4860 

ccccaccccc agagcggtga ttcaggagct ccagggcggg gctgaagact tgggtttcta 492 0 

acaagcaccc cagtggtccg gtgctgctgc tgggtccatg cgtagaaagc cctggagacc 4980 

tggagggagc cctttgttcc cctggcttca gtttcctcat ctgtagaatg gaacggtcca 504 0 

tctgggtgat ttccaggatg acagtagtga cagtaagggc agcctctgtg acactgacca 5100 

cagtacaggc caggcctctt tttttctttt tttttttttg agatggagtc tcactctgtc 5160 

gcccaggctg gagtgcagtg gtgtgatctc agctcactac aacctctgcc tcctgggctc 522 0 

aagtgattct cctgcctcag cctcctgagt agctgggatt acaggtgcct gccactgtgc 5280 

ttggctaatg tttgtatttt tggtagagat ggggtttcac cgtcttggcc aggctggtcg 5340 

caaactcctg acctcaggtg atccacctgc ctcagcctcc caaagtgctg ggattacagg 5400 

catgagccac cacgcccggt caggccaggc ctcttttgaa cactttgcac accatgggtc 5460 

ttttcatcca ggggggtagg tacagttgta cagttgagga cactgaagcc cagagaggct 552 0 

cagggacttg cccagggtca cacagcagga tgtggcaggt gtggggctgg gcctggcagc 5580 

gtggctccag ctttccagca tagaaatctg tgaaagcaga tagtttgtcg gtcggtaggg 5640 

gagactttct gagacccgcc ccagcggctc agagggtagt agccaggggc cttcctgggg 5700 

gctcataacc cagaacactg aatgggaaaa ccctgatgga ggaggcgcag tggagctgtg 5760 

ggtgccgatg ggaagtccca gaggagctgg gaggtcagta gcggtgctgc cc tctgtgga 5 82 0 

gcacttagtg ggcaccaggt gtgtttccag gttcatggcc ctgggacctg aagctcagaa 5880 

ggtgaagtaa cttgcccagg gcacccgtcg ggcagcggcg ggcagaggat ttgtgggctg 594 0 

tggagcctgt gctcgtggcc cagccctggg ggttgtgagt gtgctggccg gggagctttt 6000 

cctgcaagtg gactggtgtc taggagccag catgtcaggc agcaggcagc gggagtgcag 6060 

caggcagcgg gagcacagca ggcagagggc ggggctcgag cagccatccg tggaccctgg 6120 

ggcacggagg catgtgggag agggctgctc catggcagtg gctgaagggc tgggttgtgc 618 0 

cccgaggagg gtggatgagg gtaagaagtg gggtccccag gggctttagc aagaggaggc 624 0 

ccaggaactg gttgccagct acagtgaagg gaacacggcc ctgaggtcag gagcttggtc 6300 
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aagtcactgt 
ctgactccaa 
cttggcagtc 
aggcagaggg 
ctcagaacct 
cagggaacaa 
tgctctgccc 
gctgccccag 
ctcttggccc 
catccctgtc 
cagtcctcca 
attgaccgac 
catctactga 
ctatttacca 
accaacattt 
caccgacgtt 
ctgacgctgg 
ctgacattta 
aacaccagca 
taccaacgcc 
catttaccga 
ctgatattta 
agcaccgaca 
gccggcttgg 
catggggtca 
aaccctaaaa 
tccctcgggt 
gacgccaaga 
aggcccagcc 
gctctgctat 
acgccttgag 
ccaggtgcct 
agtttgaggc 
actgtgagat 
actgccctgc 
gagtttggtg 
tctccctttg 
ccgtggacgc 
gcagcctcgc 
gccctgccgc 
gctgtggtcg 
ctgtggagcc 
tgtttgtttt 
cactgaaacc 
tgggattaca 
gtttcaccat 
ggcctcccaa 
attttgaggt 
ccattgaggt 
ttctgaagga 
tgctctcctc 
tggagaacac 
agtctcagca 
ccacgaaaac 



ctacatgggc 
ggcccctcct 
tgacctgtga 
gctggccggc 
tcacccctga 
gtcacctgtc 
cttcctccct 
accctgggca 
ctgagagtcc 
ccaggggagc 
gcacttactg 
accgacattt 
cactggcatc 
acactgacat 
accaacactg 
taccgacacc 
catctactga 
ctgacactga 
tctaccaaca 
gacgtttacc 
cactgacatt 
ccaacgccag 
tttactgaca 
gggcaggcct 
tggacttctg 
tcgagtgtgc 
ggcccaacgg 
cagacaagat 
accccctgca 
ttggccacat 
agggtagcgc 
gtagacacag 
cagcccgggc 
ttaaattatt 
ttaagacaga 
attggttcct 
ctgggtgcag 
agcctcgccc 
cctgctgctg 
tgtggtcggg 
ggtttcagtg 
tctcttgagc 
tgttgtcgtt 
tctgcctcct 
agtgcccgcc 
gttggtcagg 
agtgttggga 
tctcttggtg 
gacattggtg 
ttttcctgag 
agggctccca 
gagtgtgtgt 
ccttcagggt 
cctaggagct 



ctcggtgtcc 
agccctggtt 
ggtcgtgggg 
tgggtcagac 
gctgccaccc 
ctttgcatag 
ctaggtgctc 
gggtcatttc 
aagggctggt 
ccccagggac 
acaccagcat 
accgacactg 
taccaacact 
ctactgacat 
aaatttaccg 
gacatttacc 
caccgatgcc 
tatctactga 
ccgacattta 
gacgccagca 
tactgacact 
catctactga 
ccaatattta 
tgctcttggc 
cttcttctcc 
caacttggat 
cctggccctg 
cgaggtgagg 
gccagatgta 
ggaatgcttg 
tatacaacgt 
ctacttggag 
aaaggggaca 
tcacaataca 
aaacatgggt 
aagcagttga 
cctcgccctg 
tgccgctgtg 
tggtcgggtt 
tttcagtggc 
gcctcgtccc 
ttttttgttt 
gttgttgccc 
tgggttcatg 
accacgcctg 
ctggtctgga 
ttacaggcgt 
ccagtgatat 
cctctcctcg 
ctgcatcagt 
ggggaggccc 
ctgtgttgca 
ggggaaccca 
gcaggtgtgg 



tcatctgtga 
tcatgagtct 
tccagggagg 
acagctgaag 
caggatctgg 
gggagccctt 
agctcctcca 
tgtccactga 
gtggtgaaac 
tctggtcact 
ctactgacac 
acatttacca 
gacatttacc 
tggcatctac 
acaccgacat 
gacactgata 
agcatctacc 
cactggcatc 
ccaacaccag 
tctaccaaca 
gacatctact 
cactgatgtt 
ctgacatcaa 
actggggatg 
agcctcatgt 
gggcaggagc 
gacctgcagg 
ctcctgtgga 
cgtattggcg 
agaaaatagt 
cctgtggtta 
actgaggtgg 
caggaatcct 
aaattaagac 
gttgttgaag 
aggcagtttt 
ctgctgtggt 
gtcgggtttc 
tcagtggcct 
ctcgtcccgt 
atgggcgtgc 
gttgtttgtt 
aggctggagt 
ccattctcct 
gctaaatttt 
actcctggtc 
gagccaccgc 
gtacatgtgt 
gggtggatgc 
ccttgttgac 
gaatcaggca 
ggatttcaga 
gggatggggg 
ccatttcctg 



aaaaggaagg 
gaggatccca 
ggcaccgagc 
cagaggctgt 
gttccctcct 
cagctatgtg 
gcccactagt 
cctttgggat 
ccgcacaggg 
gggcttgccg 
caacatttac 
acactgttta 
gacactgaca 
caacaccaac 
ttaccgacac 
tttaccaaca 
aacaccgaca 
tactgacacc 
catttaccaa 
ctgacattta 
gatactggca 
taccaacacc 
catttagcca 
ctgcagagac 
actggacaga 
ggcgtgtgct 
aggggaagct 
catgtttgat 
aggcaccgat 
tacaatactt 
cgtaagatgt 
gaggatcgct 
ctgcactgct 
aaaaagttaa 
ccagaggcag 
gtttttccat 
cgggtttcag 
agtggcctcg 
cgtcccgtgg 
ggacgcagcc 
tttggcagct 
tttgtttgat 
gcagtggcgc 
gcctcagcct 
gtatttttag 
tcacatgatc 
gcccagcctc 
ccccatcgca 
ctccctctgt 
gtcaccatcg 
gcttgcaggg 
ccctgcttct 
aggctgagtg 
ctggagctcc 



gatggggaag 
gggacatggg 
tggaagcggg 
gacttggggc 
tggggggccc 
cagaaggttc 
cagatgtgag 
gggagatgag 
tggaagtggg 
ctggcatgct 
aaacaccgac 
ccaacactga 
tttaccaaca 
atttaccgac 
cgtttaccaa 
ctgacatcta 
tttaccaaca 
aacatttacc 
caccgatgtt 
ccgacaccga 
tctaccgaca 
gacatttacg 
tgtgatgggg 
cagacagact 
ctggggagag 
ggtcaatgcc 
ctactgggga 
ccaggaggcc 
gggtgcctgt 
tctgacaaaa 
tatcattcgg 
ggagtccaag 
tttgccactt 
tcacatatcc 
ctgctggcct 
agatgtctgt 
tggcctcgtc 
tcccgtggac 
acgcagcctc 
tcgccctgcc 
ttttgctcac 
tttgtttgat 
gatctcagct 
cccacatagc 
tagacagggg 
cacctgcctc 
tgttgagcat 
ccatcgtcac 
ttccagcaac 
gggtcacctt 
cagggcagga 
gagcgggagg 
gacgcccttc 
ttgtaaatgt 



6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
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tttgtttttg 

ccgctacccg 

ggcgttgttc 

cccaggcgag 

aggtgatcct 

cagctagttt 

gatctcaaac 

gacaggcatc 

caaaattgat 

gcttgtaatc 

gagaccagcc 

ggcatggtgg 

tgagcccggg 

tgacagagca 

aattcagggc 

tagaacactt 

cccctctctc 

ttctggacgt 

ccatagcatt 

cctgactggg 

cgatcacttg 

aaaaaatttt 

tgggaggctg 

atcgcaccac 

aaaaaaaaag 

ttttgtttgt 

aatgctgctg 

gatatatgtg 

ggaaccatta 

tgagggttct 

caggtgctct 

cctgaggtca 

aatataaaaa 

gaggcaggag 

ctacactcca 

taaataaaaa 

tgcctcgcat 

cagcactttg 

cagcagctgt 

cgaccaagct 

catttttcag 

cgatgtctgt 

acagggggtt 

ccatactgtc 

agaaatgcag 

cccaggaact 

tacccacttt 

gtggcacgat 

cagcctcccg 

ttttagtaga 

gatccgcccg 

cccgatttcc 

gttgctatag 

gtcaggaggt 



gcaaggccca 
tgggagcagg 
catgtactgt 
agtgcagtgg 
cctgcctcac 
ttgtgtattt 
tcctgaggca 
agccgtcaca 
gattttaacc 
ccagtacttt 
tggccaacat 
caggtgcctg 
aggtggaggt 
agactctgtc 
atttagtatg 
tcttctgccc 
ccagcctctg 
tttgtgtgac 
gtgttttcca 
tgcagtgggt 
aggcaggagt 
aaaagttaac 

aggtgggagg 

tgcactccag 
ttcctttctt 
ctgtttatcg 
tgaacatttg 
taggggtagg 
aactgttttc 
gactttacct 
ggctcacgcc 
ggagttcgag 
ttagccgggt 
aatcacttga 
gcctgggtga 
tttattaaaa 
ttccctgata 
agatacgtcg 
gtcatgtcac 
ccaatcctta 
atgagaacac 
ggtttggtcc 
ctggggaatg 
cttgggaagt 
tcaggaggaa 
acacttgctg 
tttttttttt 
ctcggctcac 
cgtagctggg 
gatgggattt 
tctcggcctc 
cactttaaga 
ggaatatgag 
agaagggcgg 



tgtttgcggg 
tcagggactc 
ctcagcagtg 
cgcaatcacg 
cctctgagta 
tttgtggggg 
caagcgatcc 
cgcagctcaa 
attttaaagt 
gagaggctga 
gatgaaatcc 
taatcccagc 
tgcagtgatc 
ttgaaaaata 
aggacaatgt 
tgaaggaaac 
tcaaccacta 
tggaatcctg 
gattcaccca 
taggcctgta 
ttgagaccag 
tgaacgtggt 
atcgcttaag 
cctggacaac 
tttgtggctg 

ggtggtgggc 

aattcaagtt 
attgctgagt 
aacagtggct 
cctcacaaac 
tgtaatccca 
acgagcctgg 
gtggcagcgg 
acccgggagg 
caagagtgaa 
cattcatcac 
actaggatgc 
cacagtcccc 
acagcgagca 
ccgcctccta 
cgaagctcaa 
tgaagtcact 
gggaaggtgt 
tagaagactc 
gcggagctgt 
gggccttcgt 
ttgaggtgga 
tgcaagctct 
actacaggcg 
cactacatta 
ccaaagtgct 
atctgtctgt 
gcttacaggc 
agagagggaa 



ccgctgagga 
gcgtgtctgt 
gttttcttga 
gctcgctgta 
gctgggacta 
gagatggggt 
acctgcctcg 
tgattttatt 
gaacagttca 
ggtgggcaga 
agtctctact 
tactcgggag 
tgagatcatg 
aataaataaa 
ggtgcaggta 
cccatgccca 
atctactttc 
caatatgtgg 
cacattgtcg 
atcctaacat 
cctggccagc 
ggtggtgggc 
cccaggaggt 
agagcaagac 
gatgacatcc 
agtggtttcc 
tttgtttgaa 
cctatggtaa 
gcgccgttct 
gcttcttttc 
gcactttggg 
ccaacatggt 
gcgcctgtaa 
cagaggttgc 
actccatcta 
agccagccta 
tgagcgtctt 
atttgcgaac 

gggggtctct 

gtgttgtgga 
aa caggagcg 
ttatatctca 
ctcaggtgaa 
agagggtctg 
ccaggaacag 
gtcacaatga 
gtctcgctct 
gcctcccggg 
cctgccaccg 
gccaggatgg 
gggattacag 
acatcctcaa 
catggtgctg 
caggcggagg 



tgatttgcct 

ggcacaccag 

gacagggtct 

gcctcaatct 

cagacacata 

ttcgctgtgg 

gcctcccaaa 

gtggtaaaat 

ggctgggcgt 

tcacctgagg 

aaaaatacaa 

gctgaggcag 

ccactgcact 

aaaaatttta 

tctctgctac 

ccggcactca 

tgtctctggg 

tccctgcgtg 

cacgttatca 

tctgggaggc 

ctagcaagac 

acttgtggtt 

caaggctgca 

cctgtctgaa 

cattgtatgg 

accttttgtc 

cacctgttgt 

tgttaggttt 

gcatccccac 

catttaaaaa 

aggccgtggc 

gtaaccccat 

tcccagctac 

agtgagccaa 

aaataaaaca 

gtgggtgt cc 

gtcccaggct 

gagaaatgag 

gagccgtctg 

tgtagcccag 

ttttgtccac 

gtggtccaga 

aggaaggaat 

gcaaagtcag 

gggggtcgca 

cgtgagcact 
cttgcccagt 
ttcatgccat 
cgcccggcta 
tctcgatctc 
gcgfcgagcca 
agccctatac 
gacacacaga 
tcacatcctt 



tcacgcatcc 
gcctgtgaca 
cgctcgctca 
ccctgggctc 
ccaccacacc 
tgcccaagct 
gtgctgggat 
aaacatagca 

ggtggcttat 

tcaggagttt 
aaattagccg 
gagaatcgct 
ccaatctgtg 
aaaagtgaac 
tatctacttc 
cgcccattct 
ggttcacttc 
tggcttcttt 
gaatctcatt 
caaggcggga 
cccagctacc 
cccagctacc 
gtgagctatg 
aaaaaaaaca 
ccacagcaca 
tcctgtgaat 
gaattatttg 
gacttactga 
cggcagtgtg 
aatattcagc 

gggcggatca 

ctctaccaaa 
ttgggaggct 
gatcgcgcca 
aaaataaaaa 
catgtggctt 
tgccacacct 
gtttagggaa 
accccacagc 

ggtgctccca 

attggataca 
ctggagtagg 
tccagattct 
acaaagcaag 
ggagctcacc 
gcgtgttgat 
ctggagtgca 
tctcctgcct 
atttttgtat 
ctgacctcat 
ccgcgcccgg 
acagtgctgg 
agggacggag 
ggctttcaaa 



9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 
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atgggccagg gagagacacc ctctgagcat ggtaggacag gaaagcaaga ttggaacaca 12 84 0 

ttgagagcaa ccgaggtggc tgggcgtggt ggcttacgcc tgtaatccca acactttgga 12900 

aagctgaggt gggtggattg cttgaggcca ggagttcaag accagcctgg ccaacatggt 12 960 

gagaccccgt ctctactaaa tatacaaaaa ttagccaggc gtgatggtgc atacctgtaa 1302 0 

tcccagctgc ttgggaggct gaggcaggag aattgcttaa acctgggagg cggaggttgc 13 08 0 

agtgagccga gatcccgcca ctgcactcca gcctgggcca cagagtgaga ctccatctca 1314 0 

aaaaaaaaaa aaaaaaaaga taaaaagacc aaccgaggaa ttgaagtggg ggggcgtcac 132 0 0 

agtagcagaa gggggatcgt ggagcaggcc accctgtggt catgcactgg aagctcatta 132 60 

cctgacgatt tggagctcat cactgggggc ctaaggagaa tagatactga aggatgagga 1332 0 

gtgatggcgc ggggcacggg tgtctttggt ggccagaact tggggactgc tggggtgcct 133 80 

cactgcaggc cttctcagcg ccctttatat gcttacacag gctgtttcta agagggggat 13440 

acattgcata agcgttttca gactacctca tcatgggtcc ctttctttac cctctgtggc 13500 

cctggtggcg cactctctgg gaaggtgcag gtggatgccc agacccgccc tgccatccac 13560 

ctgcacgtcc agagctgact tagcctcgag attgctgctg gcacctcctg ccccgggaca 13620 

cctcggatgt gcccgtggag atgctggctc tgtgttttct gctggagttt ggtgcgtctt 13680 

ttcctcctgc aagtggccac cgctcttggg tatgtcctca ggcttctgcg agtcatggct 13740 

gcttctcagg tccttgccca gcgccaggag caaaccctcc tggcactttg ttcaggggtg 13 800 

gatgcgccag tgttcctgct gtggaccccc atctcacatg agggtcttgg gcctgcaggc 13 860 

^tcgttcagga aacacccgct gagtacgcag tgtgtgccag ctgtgtccca ggcaatggcg 13 92 0 

gggacagtgg ctgctgctgg ggttgtggtg gcttctgggg actctgggga cagctgaggt 13980 

gcaaggagcc acggctcctt gaggatgcag ttggactcca ggtggaaggg atggttgggg 14 04 0 

gaggtataaa tggggtcagg gaggagacac atttggaaca atgggaacat ttttaagatg 14100 

ctatgtcggg aggcaacaag gtggccaacc caggtgctga ggagcccaca ccagccctgg 14160 

acgtgttttg ccgctcacct ttgctgggga gtggtgggag agaggattcc gttccacgtg 1422 0 

gtggtgtgcg cagctgggct gtgtggagct gggcgctagg aggaaggtgc tttctgcggg 142 80 

gctagccggg ctctgccttt gaacacaatc aggctccagg ttttcagcat ccagtgcatg 14340 

agaggacttc acgggcagct gtggctgatc ccttgatgaa ttgggagaag aacaaaggtc 144 00 

tatgaaatga ggtttcatgt agatggcatt agagacgccc acaacagatt tacagagtgg 144 60 

agcggagacg gcggatgggt ctgggaggcc cctcctgctg gccttgactg tgacagctgt 1452 0 

cctgggaatc agcttccagg ccgccccagc agcctgactg acacacacag gggttttagc 145 80 

cccatcctgc gaccagctgt tgccatcatc agtgacagct gggagtggcg gtggttccag 14 64 0 

ccctgggcac cctccccacc tgctggggcc cacccagggc agtcctgaca cctacaggtt 147 00 

gcttggagcc gcatccgagt cctgccccac cacgtgtgaa gcccgagtgg tcgtgggctg 14760 

aggtcccctg attgcatccc cacttccctt ctgcttcaca tagctgcctc ttctcaccgt 14820 

ttttccagcc tcctgggcta ggaattccag tgttgtgctg gctttgcccc aggacacctc 14 8 80 

cttagccctc ttcctgagtc tagagccccg ggggttggaa gttctggccc ctgggacacc 1494 0 

tgcagccaca ctcagcttct cctgtgagcc tccagcatgt cccctcagga ccaagccctc 15000 

acgttcttgc ctccccgccc acctgggctc agccagggga aggcctggct gggagcgtct 15 060 

cccctctgcc ctgcccttct cccctctacc ctgcccttct ctcctctgcc ccgccatggc 15120 

ttttatatcc tgtgccacaa gacatggctg tgtgtgaaag tggcagggtc tggcatctct 15180 

gtgggtctct gaggcccacg ctccagtgcc actcttccca cccgctggcc gtgccctcat 15240 

gctggaggga cagcccagcc ctctcccgaa ccccagcccc atgtgcccag ctgcccccgg 153 00 

ccctctcccc tggaagccgg ggtcactcca gccgtatgcc atggtgggga catcctgctt 15360 

ccttggcctt ccagggaagg tcctctttcc aaatggcgac acctggtccc tgcctggagg 1542 0 

ctggaagctg tggcccttgt atgcccctcc agggtctgtg cgctcggttg gcccgagttc 154 80 

ccatcaccgt catcatcacc atcatcattg tcatttcgct tgtctgtgag ccggcctggt 15540 

ctcccagagc agagaccctc tgaggtccag cctgagttgg ggtctccgtg ctgacccctg 15600 

acggggactc aggacgtacc aggtctgggt caggagtgac ccccaaacct cgtgcccttt 15660 

gacaggcacc cctgactttt gctaagtggg tggaggtgac atcacttaca gcgggagtga 15720 

tgggacaggg tctgttggct gcactgtgct cccagggatc tggggagagg ctatatccct 157 80 

gggctttggc actgcagagc tgtgtgtgtt tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 15840 

tgtgtgtgtg tgtgtgtgtg tttgcgtgcg cgcacatgtg tataagatct ttttttatta 15900 

catgaagcaa gataactgtt gctgtttcct tttgggtttt gtgttcaaca gagtggggta 15960 

cttcttccct cagacaacag aactctcccc tttaaacacg tgctgtcaga gggtgggtct 1602 0 
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tgggctcatg tctgtttgca cagccgagtc agaggaaaca cagggttctt cataaaaaca 160 8 0 

ctgcacagca ggcgactgtc cagagtcagc ctgcaggacg gcagcagccc tgcccctcag 16140 

agcacagcta gggtgggctg ctttgggatc tcccgtcatt ccctcccagc tggcagccgg 16200 

cggccggccc attccttggt gtgctggtca ggggggcgtg cgcctgctct gctcaccctg 16260 

ggaatgggac agaagctggc agctcggaga ggacagggct ggacccttgg gtggcctctg 16320 

gctggaccat ctcattgtcc tcagacacag cctctcgggt ctagtttcat ttcctgaaaa 16380 

acaagtgcac agaactagag caggagtcga gagctacggc ccccgggcca gatccagccc 1644 0 

tgccacctgt tttcacacca tgctcaagct gagtgggttt tacatttttt aattacttga 16500 

aaaaaaaaaa gccaaaggag gtttcatgac ccatgaaaat tatatggaat tcaaaaaaaa 16560 

aaaattatat ggaattcaaa tttcagtgtc cataaataat ttcttgagac agggtctcgc 1662 0 

tctgtcaccc aggctggagt gcagtgctat ggcatggctc gctgtaccct tgacctccca 16680 

ggctcaagcg atcctcctgt ctcagcctcc tgagtagctg ggactacggg tgtgtgccac 16740 

caagcccggc taattttttt ttaattttag taaagacagg gtctttctat gttgcccagg 16800 

cttttctgga actccatctt ggcctcccaa agtgctggga ttacaggctc gagccacgga 16860 

gcccagcctg tttttgtttt ttcactgata aagttttgcc gggtgtggta gtgtgtgcct 16920 

ctagcgattt gggaggctga ggtgggagga tcgcttaagc ccaggagttt gaggctgggc 16980 

tcaagtgatc aggaggtgaa ctatgatcat gtcattgcat tccagcctgg gtgacagagc 1704 0 

aagaacctat ctcttaaaaa tatatattta aaaagtattg ggtgtggtgg ctcacgcctg 17100 

tggtcccagc tacttaggca tctgaggtgg gaggatggct tgagcccagg agtttgaggt 17160 

tgcagcgagc caagatcgtg tcactacact ctagcctggg tgacagagcc cagaccctgc 17220 

ctctttaaaa aaaaaaacca aaaaacatgt attggaacac agccatgcct gttcagtcac 172 8 0 

gtgctctcca tgctgctttc tgctccagag acccttatgg cctgaaagct gaaaatattt 1734 0 

tctatccttt acaaaaaagt ttgctgacct ctgtcctgga aaattcatct cccaagttct 17400 

cttccggcac tggcgttcct gggtgtccta aatttggccc ctgttatttc tgaactctgt 17460 

tttggctctg ttccctccca ggagccagga caggcacgtt ctctgcatct tgtcccctga 1752 0 

cgcccagagg cttggctcgg ctcaggcatt cttggaaata tctggctcca ggaaaggcag 17580 

aggcctcctg agtcagccca gagggaacct gccccaggtc tgggggaggc ctgacccagc 17640 

agagtggctt ttgccgatgg gttgggccgg tcaagatgtg ctgaaagttg tcctcagaag 17700 

gccactttgg gattccttcc tccagtatta gagcaactga gagctgctca ttgcaagcct 17760 

gatgttttcc cagttggccg ggtccaccgg gtgccctggg attctgggat ctgggtggaa 1782 0 

agtagggggc ttgggggagt gtcctgggtt ctggaatcca ggtggcaagt ggtgaggttc 17880 

agggagtggc ttctgagcca ccataggggt ctctgtggga ggctctgccc atccaggaga 17940 

ttccgcaggc cctgccggcc cagagccagc gtcttgcgct tgccgaggct acagccagcc 180 00 

ccagccgggt ggaacagccc gtcgcctcct ctcactttgt tttggggcca cctgggagtg 18060 

tggagcaagg gtagagaggg aggaagtggc tgccggccgc tgcccagcac ccttgtttgc 18120 

cttgggccct ctgtgggctc ctttttattg ctcttcaatg aagccaggga aatggacttc 18180 

cttgcctcac ttcagttcaa catgtctgga agtttggtat taaaattaag aaagtgtgga 18240 

aatagagcaa gaagagaaaa atctctccaa gagataatag tgacctctga gctgggcgcg 183 00 

gtggctcacg cctgtaaatc ccagtacttt gggaggctga ggcgggcaga tcacctgagg 183 60 

tcgggagttt gtgaccggcc tgaccaagat ggagaaaccc cgtctctact aaaaataaat 18420 

aaataaataa ataaataaat acaaaattag ccaggcatgg tggcgcctgc ctataatccc 184 8 0 

agctaaggca ggagaatcgc ttgaacctgg gaggcaaagg ttgcagtgag ccaagatcac 1854 0 

gccattgcac tctagtctgg gcaacaagag tgaaactccg tctcaaaaaa aataaataaa 186 00 

taaaaaataa aaatagtgac ctctggccag gtgtggcagc tcatacccgt aatcccagca 18660 

ctttggaagg aaggccgaga tgggcagatt gctttagcac aggagtttga gaccagcctg 1872 0 

gccaacatgg tggaacccca tctctacaaa aatagaataa aatttaagag gtaatagtga 18780 

ccttttggta gatcgaaacc tggattgctt tctttttcta aatgctgatt cttttctttg 18840 

tggtgtttgt gttctgtgcc gatgtccctc ccccagccct gttattgtga gtggaagaag 18900 

gggaaagggt tcgcccgcta ctgtgagccc ctcctctcac gctgggtgtc cttggagaag 18960 

cctgcacttc ttcattgtac gccagggctg ggtccctccc tggagtggtt ctgtgctgct 1902 0 

gggatggggc caacccctca gatgttttct gagtgtcaca cacaggtgtg tgcattcatg 19080 

gcctttgcgt gtcttcctgt tgtggaggca aaaatgtgaa gaaccctaga tgattttggg 19140 

accagggctc catcacctgc tgttcattgc acaccggagc atccaggcat gggtggagag 192 00 

ctcagacttc caggcacggt cgcaggggct ggtctaacca tgttcccgcc cgcctgctcg 192 60 
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tcagaaccgc ctgttgggag ctgttatcat gataccatac ctgggccctg ggctatccga 1932 0 

ttctgactta attgctccag gttggggcca ggccgttgtt tgctgttttg ttgtttcttc 19380 

tgtgacgtta gccactgggc taatctgagc ccctcagtta caggtggaga aactgagacc 1944 0 

catgggggtg caaggacttg ccgaggaccc agagcccctt gggggcagag ctgaggcggg 19500 

gcctggcttt gggtcccaga gcttccagtc cccttcccgc tctcctaaca gctttttttt 19560 

ttgagacaag atctcaccct gtcacccagg ctggagtgca atggcatgat ctcggctcac 19620 

tgcaatcttc gctagctgcg ttccagcgat tctcctgcct cagcctcccg agcagctggg 1968 0 

attacaggtg tgtgccgcca tgcccagctc gttttttttt gtacttttag tagagatagg 19740 

gtttcaccat gttggccagg ctgatctcga actcctgacc tcaaatgatc cgcctgcctc 19800 

ggcctcccaa agtgctagga ttacaggctg ggatcacact gtgcctggcc ctagcagctt 19860 

tgtcctgtgc catccaacaa cagatgaccg aagtctttgt ttcttaacat gcattccatc 1992 0 

tgccttacag ttttgccacc tgcaaaacag aggacttgtc gcttttctgg taagctggaa 19980 

atgtaatctg gtagcaggag gcctgtggaa gcttgccttt aatggccttg tgtctctttc 2 004 0 

atcctgtcct gagagccgga gaacttggat gttgcaccta actcaacctt cctgttaaca 20100 

tacagttctg caggctcatg gatcatcaga accacgtcct atctcacgcg gctgtatgct 20160 

tccgttggtt caggtgtttt taccttgaca gtattttctc ctcggtggct tttgcggtgg 20220 

ttgcttttaa tcagcattga ctcttcaaga aaaatattta gctgctacat ctcagaggag 20280 

acagggtgga aagcatctga gacctgcagg ctcagactta gaaccagaag tgccctcaga 20340 

gttcatccgg ccctgaccca gcgggaaatg agttcacaga gaagcgggag aactttgccc 20400 

caggccctgc cgttgctcat aactgcccca ggtccttaca tttgctccag gtcctgcccc 20460 

aggccctgca gttgctcata actgccccag gtccttatat ttgctccagg tcctgcccca 20520 

ggtcctgcag ttgctctgtg tggtgggtgt gatctggagc cctccgccca ttgctgcacc 2 0580 

tggggcaggc attgctaatt gatcccagga ctccttcctg cggagcacgc cctggttctc 2064 0 

caggcagccg ctgcctgtca gcctgcagtg gttcgggaga ggacacctgc ttgcctggtc 2 0700 

tgttccaaat cttgcttctc atcccagcac aggtaggggg tgctatggga aagggatcct 2 0760 

cagttggccc tgtcactgct ctatcagctg gggacgtggc atcctagtga aaacatcatg 20B20 

gccgggcgcg gtggctcacg cctggaatcc cagcactttg ggaggctgag gagggtggat 2 0 880 

cacttgaggt cagaagttcg agaccagcct ggtcaacatg gtgaaaccca tctctactaa 20940 

aaatacaaaa attcgccagg tgtggtggcg ggtacctgta atccgagcta ctcgggaggc 210 00 

tgaggcagga gaatcgcttg aacctgggag gtggagcttg cagtgagccg agatcttgcc 21060 

actgcactcc agcctgggca acagagtgag acgctgtctc aaaatctcaa acaaacaaac 2112 0 

aaacaaaaaa caaacaaaca aagcgtcatt tatccagcac ccctggggaa ccatgctacc 21180 

tggtgtttta tggtacctgg caaggtgcag gtgaagttgc tgctcttggg cattgaaccc 2124 0 

gtcttgtttg gggcagctca ggccccaggc agggtccggg ttggctctcg ttggtgtggc 2130 0 

cctggcccat ccagacctat atttctgccg tcctgcaggt gatcaatgtt gatgggacga 213 60 

agaggcggac cctcctggag gacaagctcc cgcacatttt cgggttcacg ctgctggggg 21420 

acttcatcta ctggactgac tggcagcgcc gcagcatcga gcgggtgcac aaggtcaagg 214 8 0 

ccagccggga cgtcatcatt gaccagctgc ccgacctgat ggggctcaaa gctgtgaatg 2154 0 

tggccaaggt cgtcggtgag tccggggggt cccaagccat ggctcagcca tgcagacttg 2160 0 

catgaggagg aagtgacggg tccatgcctg ggcataagtg ttgagctcag gtgccccgac 21660 

ctggggaagg gcaggacagg aaaggtgaca gtatctggcc aaggacagat gggaagggac 21720 

caagggagct gattagggag tggttatgga ctaggaatgt cggtaacaat ggttagaaag 217 8 0 

tgactaacat ttgttgagca cctgctgtgt gcccggccct ggccgggagc cttcgtgccc 21840 

acagtgaccc cgtctgcaaa tgtagttcct tgccctactc gcactgggga gcaggacgca 2190 0 

gagccgtgca tctcacaggt gccaagctca ggactccctc ctgggtctgc ctgggctggg 21960 

ctgtgcttgt tgcccctgtg gcccacgcat gtgcaccttc cacctgaaag ccaggatctt 22 02 0 

caggacgctc cccgaggagg tcgttgtctg gcacaatgat ttgtctcttc ctgaaaaggt 220 80 

gacagagtta cactggagag agcagcatcc aggtgcggca gggacaggcc tggggctcgc 2214 0 

gggcagggac tctgtgtcct gccggggtcc cacactgcac ctgcttgtca gaggcactca 222 00 

gtcaatcttt gctgatgaag gatgagagga cagaggacgt gatgcttgct gctgcattgc 22260 

ctgcagtcct gggtgagatg cccgggttga ctctgctgcc cgtcgggtgg atgtgatgtc 22320 

agatccccgg ctttaaaata cgagggagct gggaattgag ggagcaggtt ggggcagaaa 223 80 

gcacagcccc gtggaagcct ggagctgagg cagtgtgggc gacccctgga gcagtgagtg 2244 0 

cttccttcat ggccttcatc gcaccctgca gtcctcatgt aggggatgcc atccatgaat 22500 
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ttagttttcc 
tcacatctga 
tcgagaccat 
ccgggtgcgg 
gcgtgaaccc 
ccggcctggg 
aaaaattagt 
ggagaattgc 
tccacctggg 
tgccaggtgt 
tcacttgagc 
taccagaaaa 

gggggctgac 

atgccactgc 
aaaaagcatt 
acaaaaaata 
ctgagaaatt 
aaaaaggatt 
actttggttg 
gtgaaaaaga 
tgtctgtgaa 
aacttgcctg 
ccggaggctg 
gctcctttca 
agctcctacc 
ctggcgcctg 
ccaggggaga 
gtttgatgtg 
caggtgaaag 
acctgacctg 
ttcattgtgt 
cctctgtgtt 
ctgtgcttct 
agtgacatga 
atccacagga 
aaggaggcct 
agcctgaagg 
ctctgttcct 
acgcagttca 
atgagcaagt 
tatttgttgt 
ttgctcagca 
gaagactttt 
cgtagcgtga 
tcagcctcct 
atatttttag 
tcaggtgatc 
tcccggcata 
tggccttctg 
taatcccatc 
cagcctcacc 
ggcatggcgg 
tgaacctggg 
caacaaaagt 



cagcctcctt 
aatcccacca 
cctggctaac 
tggcgggcgc 
gggaagcgga 
cgacagagcg 
ctgggtgtgg 
ttgaacccag 
caatagagcg 

ggtggctcat 

ccagaaattt 
aaaaaaatta 
gtggcaggat 
actccagcct 
tactatccac 
aaccccagag 
gggcttcttc 
ctttttttgg 
atgagtgaaa 
ggcagctttg 
tcggaggaag 
gtccctgccc 
gaccactggg 
gcagccttgc 
tccaaggtga 
acaaggtcca 
acttggttgc 
tggcgggaat 
gtgcgtgtgt 

ggggtgagtc 

ggcttggccg 
cgcttccagg 
tcacacccca 
agacctgcat 
tctccctcga 
cagccctgga 
tagcgtgggc 
gcctcaaagg 
agctaatcaa 
ccggtagaca 
ccgttgagct 
gactctttct 
tttttttttt 
gctcagctca 
gagtagctgg 
tagagacagg 
tgcccgcctc 
ggaaaacttt 
attctgtaaa 
actttgggag 
aacgtggaga 
cgcctgcctg 
aggcggaggt 
gaaactccgt 



taaaaacgcg 
ctttgggagg 
aaggtgaaac 
ctgtagtccc 
gcttgcagtg 
agactccgtc 
tatcacgcgc 
gaggtagagg 
agactctgtc 
gcctgaaatc 
gagagtgtct 
gcccggcatg 
cacctgagtc 
gggtgacaga 
catggaaggt 
ataagacaaa 
cccctcccct 
ctgaaatatt 
ttaacagact 
gccatgctgg 
ggcctagctg 
tgaggcgttc 
ctggcactct 
tgcactttag 
atggagtgag 
gtcagagccc 
cattgccagg 
aaagctgtgc 
gtttgagggt 
ctgagctcgg 
cacccctttc 
aaccaacccg 
cgcaacccgg 
cgtgcctgag 
gaccaataac 
ctttgatgtg 
cagaacgtgc 
cttcagacac 
atatgagcaa 
cacataaggg 
tgacttcaga 
tcatttatag 
tttgagacag 
ctgcaacctc 
gactacagac 
gtttcatcat 
ggcctcccaa 
ttgccttcta 
gaaagtttga 
gccgacgtgg 
aaccccgtct 
taatcgcagc 
tgtggtgagc 
ctcagaaaaa 



ttcatgctgg 
ccgaggcggg 
cccgtctcta 
agctactcgg 
agccgagatt 
tcaaaaaaaa 
ctataatctc 
ttgtagtgag 
tcaaaaagaa 
ccagaacttt 
tccctgggca 
gtggcatatc 
tggaggcaga 
cagagaccct 
gagactgacc 
agggtgcctc 
ctcaccccgt 
taacactaaa 
ggccaaaaat 
gccaatgtga 
ggactctcag 
actgctttct 
tgccgagctg 
tttccttgaa 
ttcggacagg 
gcactgctgt 
tgttctccca 
acattggagc 
ttagcctggc 
cacccctgag 
cctgctgggc 
tgtgcggaca 
tgtggctgcc 
gccttcttgg 
aacgacgtgg 
tccaacaacc 
acacaggcag 
ttttcttaaa 
gcctatttaa 
cttttgtgaa 
caccccaccc 
tgcaaatgta 
agtcttactc 
cgcctcccag 
atgcaccacc 
gttggccagg 
agtgctgaga 
aagaagagtt 
ttggtggctg 
gcatatcacc 
ctactaaaaa 
tactcaggag 
tgagatggca 
aaaaagtttg 



ggccggggca 
tggatcatga 
ctaaaaatac 
gaggctgagg 
gcgccactgc 
aaaaaaaagt 
actactcgag 
cccgtatcgt 
aaaaaaaaaa 
ggaagactga 
acatagagag 
cctgtggtcc 
ggttgaagtg 
gtctcaaaaa 
tgtgagtgat 
catgggggtg 
ggtttgctaa 
ttaaagccaa 
aaacgaacgg 
gttttcaggg 
gagccaaggc 
tcctgggcca 
ctccctgact 
tgaaaaatgg 
tgactccctg 
tactgatacc 
ccacccccac 
ttttggcaca 
caacccagcc 
ctgtgtggct 
tgttgatgtt 
ggaacggggg 
ccatcggcct 
tcttcaccag 
ccatcccgct 
acatctactg 
cctttatggg 
gcactatcgt 
aaaaaaaaaa 
atgcttgtgt 
actcccttgt 
aacatccagg 
tgttgcccag 
gttcaagcga 
acacccagct 
ctggtcttga 
taacaggtgt 
tagcaaacta 
ggtgcggtgg 
tgatgtcggg 
tacaaaaaaa 
gctgaagcag 
ccattgcact 
attggtgtaa 



gtgcagtggc 
ggtcaggaga 
aaaaaattag 
caggagaatg 
agtccgcagt 
aaaaaaaaaa 
aggctgaggc 
accactgccc 
agaacattta 
ggcaggagga 
acctcatctc 
cagctactta 
agctgagatc 
aaaaaaaaaa 
tgttcaaaga 
tgatttaaag 
aggagatggg 
ttttaacagc 
tctgtactat 
ttgctgggaa 
cc tgaggggc 
gatcacaggc 
tcctgaccat 
ggatgagaat 
ggaccagtgc 
cttggctgta 
tactgtccct 
tcctggcttt 
atgaggtcgg 
cacggcagca 
tagactggag 
gtgcagccac 
ggagctgctg 
cagagccgcc 
cacgggcgtc 
gacagacgtc 
aaaaccttgc 
atttattgta 
gatgattata 
gaatgtgaaa 
cggtgcccgt 
acaaatacag 
gctggagtac 
ttcttctgcc 
aatttttttt 
actcctgacc 
gagccaccgt 
gtctgtgggc 
ctcacacctg 
acttcgagac 
aaattaaccg 
gagaattgct 
ccagcctggg 
ccaaagcgca 



22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
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tttgtttatg 
tatgggctct 
ctctgttcta 
ttctgctttt 
gagtgcagtg 
ctcatctcag 
tttttttttt 
caaactccta 
gccaccatgc 
ctttaagtca 
agtaaaaatt 
tgcctttaag 
agaattctgt 
caaactcttt 
ttaggcttaa 
gcctgtaatc 
gagaccggcc 
gggcgtagtg 
ttgagcccca 
agcgacagag 
tatatataca 
atataatata 
gtaaatatat 
tataaaaata 
atatgtaatg 
tattatttta 
gctgcatttc 
cagtagctgc 
gcctggcgtg 
caactgaggt 
agaatacaaa 
ggctaaggct 
accattgcac 
atgctggttc 
ccttcatgaa 
gcatggccgt 
tcgaagtggc 
acccgaggtc 
cttgtgttca 
ccgtgtggcc 
ctggtagggc 
ggcacagtgc 

agggtgggta 

ttccctaagg 
cctgaccagg 
ggtccccgga 
ggacccacct 
gggcacgggg 
tttctgtgtg 
aatccctgcc 
acagaaagag 
ggatgtgcat 
aggagcccag 
gttttgtgat 



gattgtctgt 
aaagcctaaa 
aagtattgga 
tgttgttgtt 
gtgtaatctc 
cctccctagt 
tttttttttt 
ggctcaaaca 
ccagcctatt 
caagggttct 
gtccataacc 
tcttttttct 
atcctgattt 
ggcacaatct 
ctgttctttt 
acaacacttt 
tgggcaacat 
gtgcatagct 
ggaggttgat 
caagacttgt 
tatatacata 
taatataaaa 
aatatataat 
tatatttata 
tatatttttt 
tataaaatat 
aaacattcat 
tgccagtggc 
gtggctcacg 
ctggatttcg 
aaaattagcc 
ggagaatcgc 
tccaggctgg 
ctaaaatgtg 

c ggg a gctcg 

tgactggatg 
gcggctggac 
gctggccctg 
cctcgtatga 
ctcagtgatt 
cctcagaggt 
tggctgcttt 
tctgtgtcga 
tcacataccc 
tctccttgct 
cctcaggctc 
cgagaggcaa 
ttcagacagg 
gtggacacag 
tgtcgcctgt 
gtccaggttt 
ttcctcttct 
agccgagttt 
gtgcttgtgt 



ggcagctttt 
acatgtgcca 
caactacaat 
gttgttgttg 
agctcactgc 
agctgggact 
ttgtggagac 
cccacctcag 
ctactgtttg 
ttgtagaaaa 
tctccagcca 
ctaagatagc 
tttcacttga 
tttaaatgta 
attttgcgtg 
gagagggtga 
agtgagaccc 
gtagtcccag 
gctgcagtga 
ctggggaaaa 
cacgcacaca 
atatatattt 
ataaaaatat 
aataatatat 
aatgtatgat 
atataaaatc 
ttaaaaactt 
caagcatctc 
cctgfcaatct 
agaccagcct 
aggcgtggtg 
ttgaacccag 
gcaagaagag 
gcccttttcc 
gtggagcacg 
ggcaagaacc 
gggcagttcc 
gatcccacca 
gacagtgcgg 
agagctgtac 
catggagttc 
cacctgggcc 
ttttacagat 
agttagagca 
ttgcactctt 
ctccgaaatg 
atgtgctttg 
ccccagatgt 
tgctcacccc 
tctgctaggg 
tagagcaagg 
ggatctgcta 
acatccggat 
ccatcctggt 



gttctgccga 

tccgcccctt 

gtttgctcat 

agatagggtt 

agcctcgacc 

acaggcacac 

agggtttccg 

cctcccaaag 

tattacatag 

aaatatatat 

gagacgaccg 

atttctcttc 

cattacaaca 

tgaaatactc 

tgctggttac 

ggcaggagga 

catctctaca 

ccaccaagga 

cctgagatta 

aaaaaaaaaa 

cacataatat 

ataaataaaa 

atattatata- 

aatacatact 

atataatata 

tccaagttgc 

gaatgctggt 

gggagatgtc 

cagcactttg 

tgccagcttg 

gcatgtgcct 

ggggcagagg 

cgaaactccg 

tcctcacctg 

tggtggagtt 

tctactgggc 

ggcaagtcct 

aggggtaagt 

gggtgccaac 
tgatgtcatt 
cttcgtggag 
gtctcaccga 
gcagaaacag 
gagctgggcc 
gccaaaacca 
gcctcttgga 
agctgccagg 
gtggggcgtc 
ttaaagcacc 
aaggaaggaa 
gcaggtcaaa 
aaagaagagg 
cccgcaaggc 
ttctgccgtg 



gatgagttgt 
tacagaaaaa 
ttattattct 
tccctctgtc 
tcctgggctc 
accaccactc 
catgttgccc 
tgctgggatt 
ctttaaaaga 
ataggaaagt 
ttgctgacac 
atcacagtca 
ggtatttgat 
cactgcacag 
agccgggcac 
tcacttgagc 
aaaaactttt 
ggctgagttg 
ctccactgta 
aatatatata 
aaaaatatat 
tttataaatt 
atatataata 
tataagtata 
catttataaa 
tttttccaaa 
gatctggtcc 
tacaaaacac 
ggaggctgag 
gtgaaacccc 
gtaatcccac 
ttgcagtgag 
tctcaaaaaa 
ctgccagacc 
tggccttgac 
cgacactggg 
cgtgtggagg 
gtttgcctgt 
tgggcaaggt 
agccttgatg 
cgggtgctga 
agtgtccatg 
gctcagagaa 
aggaagtgct 
tgatccagaa 
ggctgctgag 
cgtcctgggg 
tttctggact 
tgtgatgtgt 
gacttcagga 
cttagaaaat 
gaaggagggg 
ctcccctgcc 
tccccaacat 



gacagatctg 
gtgtgctgac 
atgatttgtt 
actcaggctg 
tagtgatcct 
ctggctgatt 
aggctggttt 
acaggcgtga 
ttttttatga 
ataaaaagaa 
ctcagcatat 
tatgctacgc 
ggcgctgtga 
atgtttgctt 
agtggctcat 
ccagaagttt 
ttaataagtc 
ggaggattgc 
ctccaacctg 
tatatatata 
atttataaat 
atatttataa 
aaatatataa 
tatttaaaat 
tacacattta 
aaggtgtctt 
agaatgtgtt 
gctggttctg 
gcaggtggat 
atctctacta 
ctacttggga 
ccgagatcgc 
aaaaaaaaag 
atcagccgcg 
taccccgagg 
accaacagaa 
gacttggaca 
cccgtgcgtc 
ggcaggctgt 
gtggccagga 
ggctgliatca 
gagcctgcgt 
accgagtgac 
gtctcaggct 
ctgactttga 
ccacagctta 
gccctgcctt 
tgagttttct 
gcagcagccc 
tggcaggaca 
tctggaatga 
ctgctggggg 
ctgaggtctt 
ccggccaagc 



25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
27600 
27660 
27720 
27780 
27840 
27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
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ttaggtggat gttccagcac acactcaccc tgtctgtgca cctgtttttg tgtccgtaag 2 9040 

tgggtattta ctcaccttac gagtgagcca ctgtgggaat tcagggaggt ggcgcagtga 2 9100 

ccacccctgg agggatatgt gtgtggcagg ggtcgagggt ctcgcccttc cctgctfccct 29160 

gcgcgtggct ttctccagga cggggagggc tgagctgaag aggtggggac agttgcgtcc 2 9220 

ccccgccacc cactgtcctg cggtgagagc agactcactg agcctgccct tctcccttgt 2 9280 

gccttccagc tacatctact ggaccgagtg gggcggcaag ccgaggatcg tgcgggcctt 2 9340 

catggacggg accaactgca tgacgctggt ggacaaggtg ggccgggcca acgacctcac 2 9400 

cattgactac gctgaccagc gcctctactg gaccgacctg gacaccaaca tgatcgagtc 29460 

gtccaacatg ctgggtgagg gccgggctgg ggccttctgg tcatggaggg cggggcagcc 2 952 0 

gggcgttggc cacctcccag cctcgccgca cgtaccctgfc ggcctgcaag ttccccaacc 2 958 0 

tggcaggagc tgtggccaca cccacgactg cccagcagcc tcaccctctg ctgtgggagt 2 9640 

tgtccccgtc cacccctggg tgcctttgct gcagttatgt cgggagaggc tctggtgaca 29700 

gctgtttcct gtgcacctgc tgggcactag gtcccagcta atccctgtgc caggactcta 2 9760 

atttcaccct aacacacatg gtggttttca ttgctgggga agctgaggcc tgagcacatg 29820 

acttgcctta ggtcacatag ctggtgagtt caggatcccc cagagatacc agggccagca 2988 0 

ctcgatcccc acccagccct gaaccccacc atgtgctggg attgtgctgg gagtgtccac 2 9940 

acgcctggga ccccagggct ggtgctctca tctccttttt ccagatcatg agaatgaggc 30000 

tcagggaagt ttgaaaaaaa cctatcccaa gtcacacagc aacaggagca ggatttgaac 3 0 060 

ccagaaaagg ggaccgcaca ctctgttctg ctagagtagt tagctgtcct gggtgatatg 3 012 0 

gcaggtgaca ggggcaactg tgcttaacaa aggaaccccc atcccccctg ccaagttggg 3 018 0 

agactagaag gtcaggggca gaagctctga agggccaggt gcagtggctg acacctctaa 3 024 0 

tcccagcact ttgtgaggcc aaggcgggca gatgatttga gcccaggagt tcaagatcag 3 0300 

cctgggtaat gtagtgagac gccatctcta caaaaaaatt ttttaaaaat tagctgggca 3 03 60 

tggtggttca tgcctgtagt ccaagctact tgggaggctc aggtgggagg attgcttgag 3 042 0 

cccaggaggt tgaggttgtg gtgagctgtg atcatgccac tgcactccag cctgggcaat 30480 

agagtgagac cgtctccaaa aaaaaaaaaa gaagaagaaa aagaagctct gaggctccaa 3 054 0 

gtccccaggc accccttggc ttgagggcag acaagggagg agagggtcac ctgggcagcc 3 0 600 

ctgacttttg tcccctggca aagggacctt cagtgacctt ggccctagga gagcctctga 3 0660 

gcacgtcagc catgtcgaac cgctcaggaa gggcagcaag aatttggctt ctgacctctg 3 072 0 

cctctcctac tcgccatctg cactgggtgt ggttgtgccc attttacaga tgaggaggct 30780 

ggggcatcga ccagctgaat gccttgtccc aggtactgcg taggcagagc tggcagttga 3 0840 

accccgtgtc ctggttgtcg ctgggggtgg gctgcaccct gacttgtgag gccagtagca 30900 

aggtttgcac gtgacttcgt gaccgtcacc cagctctgca gcacatcccg tgacccagct 3 0960 

catccaggcc gcatgcaaac ctgttgccag gcgagaaacc agtcaccgca cagctgtggt 3102 0 

tgcctgaaat gattaagctc attaatcacc ccggagtgag gacagactca gatgaaaacc 31080 

agcaaaagcc ctggaaactc atgtgaccct gccaatgagg gcggccatgt gcattgcagc 3114 0 

ctggccgtca ctcctcggta cgtgttttgg acttaaacgc tccggatgtt tactgagtgc 312 0 0 

ttgattaata acatggaagg cctggtctca ttgctgtggg agtgaaggat gcacagccag 312 60 

gcctgacatg atgagaacaa gaacctggag tctcgctgcc tgggtggtaa tcctggccct 3132 0 

gccacttagc aactgtgtga ctgtagccag gtcacttaat tttgctagat cctgcctgcg 31380 

cttcagtgga tcttgctggt tttccaaggt ggccaaacac tttaaggcat tcatgtggtc 3144 0 

gctaggctgc agggttgaac cctggctcac cccgcagggc gccgtgtgct ctgtggcctg 31500 

gctgtgcctt tgctgacacc gtgcccgtgt gtgttcatgc aggtcaggag cgggtcgtga 3156 0 

ttgccgacga tctcccgcac ccgttcggtc tgacgcagta cagcgattat atctactgga 3162 0 

cagactggaa tctgcacagc attgagcggg ccgacaagac tagcggccgg aaccgcaccc 3168 0 

tcatccaggg ccacctggac ttcgtgatgg acatcctggt gttccactcc tcccgccagg 3174 0 

atggcctcaa tgactgtatg cacaacaacg ggcagtgtgg gcagctgtgc cttgccatcc 31800 

ccggcggcca ccgctgcggc tgcgcctcac actacaccct ggaccccagc agccgcaact 31860 

gcagccgtaa gtgcctcatg gtcccccgca cctcactccc tcgttagatc aggctggttc 3192 0 

tgggagctga cgctgaaagg agcttctcat ctggggttcc tgggtgtaca tagatggttg 31980 

ggtaggttgt gcactgcaca agctgcatga tgctacctgg gggtccaggt ccaggctgga 32040 

tggacttgtt gcttcatcag gacatagata aatggccaaa actcctcagc tggaaggtcc 3210 0 

tgggcaggat ctttgggtgt gaaaaccagt cacaggggaa gggtgcttgc tcatactgcc 32160 

agcacagtgc tgagtgcttt ccatagcgct cgtttactcc tcaagcctgg agggtgggga 32220 



-85- 



BNSDOCID: <WO 02092000A2J_> 



WO 02/092000 PCT/US02/14877 



gtagcatggt cccatttcac gtacaaggaa cccgatgcac agagaggtgt ggcaacccat 32280 

ccaaggccat acaactgggg tgggttgagc cggggttgac tgtggcaggc tggctcaaga 32340 

gtccctgctc ctgaaccctt gccaggcagc ctggcatcag ctcggggaat ttttgccctg 324 00 

acccttggaa gcaagtgggc ctctttgttc tcatgtcagt gatgagaaga gtgactttcc 32460 

tatggcccct ctggagtaca ggtgtttcct gttggcgggc tcttccccca tgacatcagc 32 52 0 

agcgagctgg ttatgattcc ctacgcagaa cttgatagtt tataaagctc tttgtcatcc 32580 

aggccccgtt ggagtctcac gcagacctgg tcgcaggcgg ggctggtctt gcctgtccca 32 640 

gctgcatgga tggggaactt gaggcttgca aaggttaagg ggctgttcga ggcccaggct 32 7 00 

ggcaggagat gggcctgggc cagagtctgg gacttcccat gcctgggctg tctttggtcc 32760 

tgttgctcac catccctccc tggggccatg accttagaga gccaaatgga ggtgcaggta 32 820 

acccacggca aggaggggtt gccatgactc agagtccccg tcctgtggcc ggcagtacct 32 880 

ggtgcaacga cttggatttc agaccagcca ctgtagcccg ctgacggtgc gctcgaagtg 32 94 0 

ccacagcttc tgaagccagg caggactcag gccaggagac tctgttagct gttgagaggg 33000 

agaggccaac ggatgttctg gttctgctag agagctggtt cttcggatcc tggtaccagt 3 3 060 

gcactgagag gaggcccagc ttgattctgg ggctgccttg tggtggcatg tgctgctcac 3 312 0 

tgacaccctc gaggagtgtc ttctctcggg cttgttgact gtgcccggtt ttccgcagtt 3 3180 

cactggtgca cacataggca catagcaaac cgcacacaca gtcgtgggta tgagtttcac 33240 

tacattccac caccagtgtt cactaccatt acctgccttc cgtcttaagt gttcatcatt 33300 

taaaaataaa tttattgggc tggacgcggt ggctcatgac tgttatccca gcactttggg 33360 

aggctgaggc gggcagatca cctgaggtca ggagttcaag accagcctgg ccaatatggt 33420 

gaaactccat ctctactaaa aatacaaaat tagctgggca tggtggggca tgcctataat 33480 

cccagctact caggaggctg aggcaggaga atggcgtgaa cccgagaggc agagcttaca 33540 

gtgagcccag atagcaccac tgcagtccag cgtgggcaac agtgcgagac tccatctcaa 33 600 

aaaaaaaata aataaataaa agaaaaataa atttatgatc tatttcaaaa ataacacatg 3 3 660 

tactttgaaa cagcagagac acatatgaca cggagaatga aattccccat agcgcacccc 33720 

caagagacag ccctggtccc cccgtctttc ccgtggacct ccagcggggc agatgctgag 337 80 

ccgcctgttg tcgagtggcg tgctatcccg tcctccagct cctctgtggc ttacagacac 3 3 840 

ccacctgcag ccctgtcttt gcctcctcta gcgcccacca ccttcttgct gttcagccag 33900 

aaatctgcca tcagtcggat gatcccggac gaccagcaca gcccggatct catcctgccc 3 3 960 

ctgcatggac tgaggaacgt caaagccatc gactatgacc cactggacaa gttcatctac 34020 

tgggtggatg ggcgccagaa catcaagcga gccaaggacg acgggaccca ggcaggtgcc 34 0 80 

ctgtgggaag ggtgcggggt gtgcttccca aggcgctcct cttgctggtt tccaggctgc 34140 

tgcccctgtc cttagcagag ggaggaaaca gaggatggct ctgggtgaat gatgacttgg 342 00 

gcttcgatta tgtagtcaca gggtatgacc ctgagatgcg tggaaccccg agactgtgat 342 60 

tatatgtaga aactgggttt ccccgttgtt taagtagtca tggtggggtc agaccccaca 3432 0 

ggacttttgt cttttcaaga aagaaaatgg tcgtgtgtca tgcaggggta gttggtactg 343 8 0 

gttaatccag gtttatcctt tattttgtgg gaactgtaca gtcatttctg ctacaatgct 34440 

gtatatgctc ttctgaaaga cacctatgca aaatcgcaca gtaaaaatga cacaactcat 34500 

agggaaagcg gggccagggc acagccctca aaatctccat caatgacatg taagaaaaga 34560 

gaggaacctg ggaaatagca aagtgccttt tgcacattaa atggttagct atatcccaca 3462 0 

atactgtgca ttcgtaaacg ttaatgctgc aataaatacg gcacttcacc ttgggaagat 34680 

ctggagttgg cttatgagtg tggaagggtg tagcgcatga gtttttgtga aacactggaa 34740 

ggaggattgt gggaaatcaa atggaaagtt ctcaccccag gcgtggagaa gagtgggtca 348 00 

tggccccagc agtgagccca gggaggtcag agacggaggt gtgtgtgtgg gtgtgaccct 34860 

gcgcagttcc ctgccggctg tagttttttg cattcgctta atgtttctcg tggaggaaat 34920 

tgtgcatgag caaatgtgaa accgtgctgt gctcaaattg tcctaataca tcattgcatt 34980 

ggaacagatt ggcttttttt tttttttttt tttttttttt tttttgagat ggagtctcac 35040 

tctgtcacca gcctggagtg cagtggcatg atcttggctc actgcaacct ttgcctccta 35100 

tgttcaagtg attttcctgc ctcagcctcc tgagtaactg ggattacagg catgagccac 35160 

cgcggccggc cagatttgca tttttgaaac aactgctagg ctgggcgcgg tggctcacac 35220 

ctgtaatccc agcactgtgg gaggccgagg caggtggatc acctgaggtc aggggttcga 352 80 

gaccagcctg gccaacatgg tgaaaccccg tctctactga atatacaaaa atcagctggg 35340 

tgtggtggcg ggtgcctgta atcccagcta ctcaggaggc tgaggcagga gaattgcttg 35400 

aacccaggag gcagaggttg cggtgagccg agatcacacc attgcactcc agcctgggca 3 5460 
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acaagagcaa 
tgagcacttt 
cctcaggccc 
aggacaggcc 
ctagcacttt 
gccaacatgg 
cgcgcctgta 
gaggttgcag 
ccatctcaaa 
gtgggagctt 
gcgcctggag 
cccatctgaa 
tcccgcagca 
agtcagccga 
ctgtgatcat 
gctacaggtg 
ggctcccatg 
ttcttttatg 
ttccacttaa 
cctgcagcct 
cccccaaaag 
tgtttttatt 
tgttattgtt 
cgatcatggc 
ctctagtagc 
ttttattgta 
cctcccacct 
ttttaatttt 
agtggcgtgg 
tcagcctcct 
gtatttttta 
ctggtgatcc 
cccggccttt 

ggtgggctct 

ctggctaact 
atggactcaa 
caccgcaccc 
tcacgaggta 
cgaaactgtc 
tatctctctc 
tgtgtgactg 
gtgtcagaat 
ggctcacacc 
aggagtttga 
ttagctgggt 
aatcgcttgt 
gcctgggtaa 
acggaccact 
tcgtgaataa 
aaaggcgtgt 
atttgtacta 
ccagcacttt 
gccaacatgg 
cgggcacctg 



aactccatct 

gcggagtcag 

acctggagtc 

tggctcctaa 

gggaggccga 

tgaaacccca 

gtcccagcta 

ggatctgaga 

aaaaaaaaac 

gtgttattgt 

agactttcac 

ggctggtgcc 

gtttctagtt 

attgaaattc 

tgcctctgct 

actgtcacgt 

ctgagaaagc 

gtgtctgcta 

tttggaaaat 

ccccgctctg 

atggtgtcta 

ttttattgtg 

gtttttgaga 

tcactgcagc 

cgggactaca 

gagacccgct 

caccctccca 

tttttttttt 

tatcaactca 

cccgagtagc 

gcagagacgg 

gcccgcctcg 

ttttgttttt 

tggctcactg 

gtattttatg 

gcagtccacc 

agcccatttt 

accatcactg 

cgcattgaac 

tgccttctct 

gcttctttca 

ttccttcctt 

ctctaatccc 

gaccagcctg 

gtggtggcgc 

acccgggagg 

cagagtgaga 

tcttgttatt 

ggccactatg 

ttgtggtgtt 

cattaaaatg 

gggaggccag 

tgaaaccccg 

tagtcccagc 



caaaaaataa 

gcttgtgtgg 

tgcagcatgg 

gcagccttct 

agtgggtgga 

tctctactaa 

ctcgggaggc 

ccatgtcatt 

ctacagagcc 

ggtgaaatct 

atgggtcgcc 

gtccagacag 

acgttgtact 

agagccatgc 

gtggtgtggg 

ggagtccttt 

tcctgggcgt 

ggcagaagtg 

gtgcttttcc 

tggactctgg 

aattatcatc 

gtaaatttat 

cagggtctca 

ctcagcctcc 

ggcatacact 

atgttgccca 

aagtgctggg 

tttgagatgg 

ctgcaacctc 

tgggattaca 

ggtttcacca 

gcctcccaaa 

gagacagggt 

cagcctccgc 

tagagacaga 

cacctcagcc 

acctattctg 

ccattcatct 

acactcctca 

aggtacctca 

tgtgacatgg 

aaagcctgaa 

agcattttgg 

gccaacatag 

gcacctgtaa 

cagaggttgc 

cttcctgtct 

tatccatcca 

aacatttcct 

gcataggatt 

aaagctcagg 

ggcgggcgga 

tctctactaa 

tacttgggag 



aaaatagaaa 

cctgttccac 

ggcacaacag 

tttacaaaaa 

tcacgaggtc 

atatacgaaa 

tgaggcagaa 

gcactccagc 

acacggcctc 

tggtactttc 

atgtccgccg 

gctggacgcc 

gtgaggtctg 

ctggctccct 

tggtggaaat 

ggagagaggg 

acattgcctt 

atgaagactt 

cctcctgtgt 

ctttgattct 

caattggccg 

ataacataaa 

ccccagtgcc 

agggctccag 

accacatctg 

ggctggtctc 

attacaggca 

agtctcactc 

tgcctcccag 

ggtgcccatc 

tgttggccag 

atgctgagat 

cttgccctgt 

ctcccaggct 

tttcaccatg 

tcccaaagtg 

cagttgacag 

ccagactact 

tctccctctg 

tgtaagtgga 

tgtcctcaag 

taataacccg 

gagtccgagg 

tgaaaccctg 

tcccagttac 

agtgaaccaa 

caaaaaaaaa 

cgggtgctag 

tccgtggtga 

ctggtaagaa 

ggccgggcgc 

tcatgaggtc 

aaataccaaa 

gctgaggcag 



aacaagtgct 

aaatgatgtg 

gttcattagt 

ctgcagagcc 

aggagttcaa 

attagctggg 

ttgcttgaac 

ctgggcaaca 

tttctccacc 

ttgaggcaga 

tcggtttcgc 

cctttccacc 

tgtccttggt 

ggagcttctc 

ggattccttt 

acgtgttaat 

agagtttcat 

ggaagaaaac 

cttttgctaa 

ttattaggag 

aggttttgtt 

atttgccatt 

caggctggag 

tgatcctctc 

gctgattttt 

aactcctgga 

tgagccacaa 

tatcgcccag 

gttcaagcga 

actatgcctg 

gctggtcttg 

tacaggtgtg 

cacccagact 

caagttgtgc 

ttgcccaggc 

ctgagattac 

ttcagtggca 

tcaccttctc 

acagccacca 

attataccaa 

gttcatctgt 

ttgtaaaggc 

tgggcagatc 

gctctactaa 

tcaggaggct 

gattgtgcct 

aaaaatcatc 

gtttcttcca 

aggttttgta 

agtttgcact 

ggtggctcac 

aggagatcaa 

aaactagcca 

gagaatggcg 



gtagcggaag 

ctcacggtgg 

gtagaattcc 

cgcctgtatc 

gaccagcctg 

tgtggtggca 

ctgggaggtg 

gagcgagacg 

gagtgttggt 

gagaggctga 

tgttgtgctc 

agatccttcc 

tgatggcaaa 

tcctgggcag 

catcttgctt 

tgatggatgt 

tggagctgcg 

ccagaaggtt 

ggtccagcct 

tccccctgct 

ttctattaat 

ttaattgttt 

tgcagtggtg 

acctcagcct 

tgtatttttt 

ctcaagccat 

cacccagcca 

gctggagtgc 

ctctcctgcc 

gctaattttt 

aactcctaac 

agccaccgtg 

ggagtgcaat 

acctccacac 

tgggcttgaa 

aggcgcgagc 

ttcagtcagt 

ggcagatgtc 

ttctactttg 

tatttgccct 

gttatagcct 

tgggcgcggt 

acttgaggtc 

aagtacaaaa 

gaggcaggag 

ctgcagtcca 

ggatggatgg 

cctttggttg 

ctagtgagga 

aaccataagt 

gcctgtaatc 

gaccatcctg 

ggtgtggtgg 

tgaacccggg 



35520 

35580 

35640 

35700 

35760 

35820 

35880 

35940 

36000 

36060 

36120 

36180 

36240 

36300 

36360 

36420 

36480 

36540 

36600 

36660 

36720 

36780 

36840 

36900 

36960 

37020 

37080 

37140 

37200 

37260 

37320 

37380 

37440 

37500 

37560 

37620 

37680 

37740 

37800 

37860 

37920 

37980 

38040 

38100 

38160 

38220 

38280 

38340 

38400 

38460 

38520 

38580 

38640 

38700 
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aggcggagct 
agactccgtc 
ttcagggttt 
aggagtgtgg 
actgctgtcc 
cctgattttg 
tccagggtcc 
gccccctagg 
tccttcgtgt 
tgtgtgtgtg 
ggggagggag 
gcgcgtgtgt 
gtccaagaga 
tttgtatttt 
gcatagctca 
gaatagctgg 
ggtttaattt 
cctgaactca 
ccactgctgt 
gacttactat 
agcttttccfc 
gccaaatcac 
ccccgcaggc 
ttcgaaggga 
gtgggctggg 
gacggagagt 
ctgagtcatt 
tacccccaca 
tgtaccttca 
taacccccac 
taaggggcgg 
ccgcagtcca 
cccatggtct 
tgatgcacag 
gctcttccag 
agctgattgg 
tgagaagtgt 
gccagtgccc 
ccagtgccca 
ttgttttgac 
tcgacatcta 
acaggctgag 
ccatcgtcgt 
tccaggcgtg 
tccacctggg 
cagcccgtcc 
tgagaggtta 
cttgttaaac 
tcgctcctgt 
cccacctgtt 
ccagagagaa 
ccaaactgag 
ggggctggca 
ttgagccctg 



tgcggtgagc 
tcacgcaaaa 
atgaaactgg 
ccagggtgtc 
caaagtctgt 
gccagggtag 
acagctagtg 
gtccacagct 
ccagatgaaa 
tgtgtgtgtg 
gcaccctttc 
gtggtgtgtg 
ctgggcctgg 
tttgagacag 
ctgcagcctc 
gactgtaggc 
tttttgatac 
agtgattctc 
ctccctgttt 
cagagcacca 
tcaggcctgg 
tcgggaactt 
agtaggcttc 
tgctgtgtcg 
cttaggcagc 
ccttgagagt 
ccccacactc 
tctgtaccga 
cctcccccac 
cctcacctgc 
cccccaaact 
aaagcttgag 
tggatccctt 
ctgggtgccc 
aggctgttgt 
agaggagaga 
ggcctctgct 

gggggtctcc 

ggggtctccg 
ctctctgagc 
cagccggaca 
cggggaagcc 
caacgcggag 
cccgccgtgt 
gatccaatgg 
gggaaagcag 
caaggcagcg 
ttgaaccctg 
tccaccacag 
ggtcgtcaga 
aagccaggga 
cacgggaacg 
tgaagacggg 
actctgtcac 



cgagatcgct 
ctctgtctca 
ccagccgcgt 
acgtcacaca 
ttttgcatct 
ctggaatctt 
gcttttagga 
ctgtagtgat 
gtgatgatgt 
tgtgtgtgtg 
catctggggg 
gtgtgtgtgt 
ggaagacgct 
ggtctcactc 
ggcctcccag 
gtgcgtcacc 
agatggagtc 
ccacctcagt 
tattaactgc 
tcctaagatg 
gtggcaggag 
ggatgcagct 
tgggctgtcc 
gaggccccaa 
gctcgagcat 
cactgagaga 
acctgcccct 
tccctttacc 
tcacccgccc 
cctcccctca 
gtctgtcctg 
tcctggaagg 
ctcgactgtc 
ggcgggtggc 
cctaatcgct 
cccccacacc 
gtcctgccag 
accagtgccc 
ccagtgctca 
caaggccaaa 
ctgttctgga 
atgggggtgg 
cgagggtagg 
cttatgccga 
gtggctttcc 
aaagccccaa 
ctggccgacg 
tgcagaaatc 
ggagctcctg 
gcttctggtt 
aagagggaga 
tttggggaca 
agctacagcc 
ttactggcca 



tcactgcact 
cgcaagactc 
aaagtttgct 
gtacacgttt 
ggttcccaca 
gcttgtcttt 
aggaatttgt 
gtggatgttg 
ctttgcagct 
tggtgtgtgt 
tgtgtgtgtg 
gtgtatgggg 
tctttttatc 
tgtcacccag 
gctgaagcga 
atactgagct 
ttgctatgtt 
ttcccgacat 
tgaaagacct 
attccctctg 
cccaggtgct 
gtctttcagg 
tctcctccta 
aagcccaggc 
ctcctggacg 
cgtggctgcc 
acccaccctc 
cgcaccttcc 
ctgcaccctc 
cctggcctcc 
gtgccctgca 
tggaggagac 
aatggggcct 
tgaggaggcc 
ctggcatact 
aatactggga 
agctctccag 

gggggtctcc 

ggagtcttgg 
acccagacag 
cgtgcgaggc 
tgctgcgtgg 
aggccaacgg 
atgccagcct 
agggtcccaa 
ggggctggaa 
ggagttgcag 
ccttccacgg 
ggcttcttcc 
ggtgggaagg 
ccgaaaccca 
ggagcgcgat 
agcacaggtc 
cgtgaccttg 



cgagcctggg 
cgtctcaaaa 
gtgttgtttt 
ctcagatggt 
gacccaccct 
cagcccggca 
tcagttggct 
ttatctacaa 
gcccagcaag 
gtggtgtgtg 
tggggtgtgt 
gaggcaccct 
tacttagaga 
gctggggtat 
tcctcccacc 
attgtttttt 
gcccagacta 
tctgggatca 
agataaagaa 
actcaatgga 
ccaggcccca 
gtaacccaaa 
cgtcagctca 
tcatccctga 
gtgaccccag 
ctgccttccc 
acctggcccc 
ctacccaccc 
acctgtcccc 
ttccgttggg 
gagaaaacag 
agggatgtgt 
tcatgggagc 
taaagtccga 
caggcgggca 
tttgactttc 
ccagtgccca 
gccagtgcca 
tttctttgtc 
gcagccccac 
caccaatacc 
ggaccgcgac 
gtgggtgggg 
ctcacaggct 
aagcaaacac 
ggggaaaggg 
ttgataggtt 
catgggggct 
tcccagaggc 
cacccaggac 
tgtgacatga 
ggccttcctc 
ctgggccggg 
ggcgggtggc 



caacagagca 
aaaaaaagag 
tgtgcccggg 
ggttctccag 
ccacggtgag 
gctgtaccag 
ttgacacatg 
agacacatga 
gctgtgtgtg 
tgtgtgtatg 
gtgtgtgtgt 
ttccatctgg 
ctttgtttta 
ggtgatatga 
tcagccttct 
ttgtttggtt 
gtctcaaact 
caggtgtgag 
agtctgaaaa 
gagggagggg 
tttgccccag 
ggaaccagat 
gtaagagccc 
gatgcacagg 
agagtgtgga 
aagaggggct 
cagcctcacc 
tcacctcccc 
caccttcacc 
gaaggggttg 
tacgtgaggg 
tgggaagggc 
gccagtctag 
ggcggcaaga 
cgtagttagg 
aggctaaact 
gggctctcca 
ggggtctccg 
ttacagccct 
gacctcagca 
atcaacgtcc 
aagcccaggg 
gtgctgcccg 

ggggagactt 

aggtttttca 
ggagctctgc 
ttgtatcatc 
gcctgttgac 
ccccgacgct 
cttgaggtct 
aactcaggct 
agatagctgg 
agcccagaga 
atagcctctt 



38760 
38820 
38880 
38940 
39000 
39060 
39120 
39180 
39240 
39300 
39360 
39420 
39480 
39540 
39600 
39660 
39720 
39780 
39840 
39900 
39960 
40020 
40080 
40140 
40200 
40260 
40320 
40380 
40440 
40500 
40560 
40620 
40680 
40740 
40800 
40860 
40920 
40980 
41040 
41100 
41160 
41220 
41280 
41340 
41400 
41460 
41520 
41580 
41640 
41700 
41760 
41820 
41880 
41940 
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ggagactcag tttcctcatt ggtaggagtg acggccacag tggtgcggcc tctgcagcac 42000 

acggggggct cggtgggcgg aagccccggg tctataaggc ggctgtgcag gagccagccg 42060 

agctggtctc ccaacagcca gggctccggg gtccttagca gctgtggggg gcctgcacct 4212 0 

gtttcccatg gctgctgtca gaaattacca gaagccaggt ggctgagagt aatggacact 4218 0 

tgttctctca cagttcctga gggctgaagc ccgagatcga ggtgtgggca gggccctgcg 42240 

ccctctgaag gctctgaggg aacctttggg cttctggtgg ctccaggcac cccttgactt 423 00 

gtggtcctgt cactccagtc tctctgtctg gctgcacatg gcgtggcctc ttctgtacca 42360 

ttgaaggaca cttcagttgg atttagggcc taccctcacc cattgtggtc gtatcttgat 42420 

ccttcatgac atttgtaaag accctgcttc caaataagct cacattctga ggttctgggg 42480 

tgagcgggaa tttggagagc attgttcaac tagtatagaa tgtgacctgfc cagcctcggg 42540 

cagccctgag aggcaggggc tttccacagc ccagctgggt gccctgggct ccgtgctgtc 42600 

cgaggagacg ccatccccac acccgtcctt cacccgccac cctcccgcag gtacctgtac 42660 

ttcaccaaca tgcaggaccg ggcagccaag atcgaacgcg cagccctgga cggcaccgag 42720 

cgcgaggtcc tcttcaccac cggcctcatc cgccctgtgg ccctggtggt agacaacaca 42780 

ctgggcaagc tgttctgggt ggacgcggac ctgaagcgca ttgagagctg tgacctgtca 42 84 0 

ggtacgcgcc ccggggcctg ccctaaccgc agacacccgg ccttcattgt cagtaatggc 42 900 

agcagctgcc acattgtccg agacctgccg tgagcccagt gccgcgccag gggctttgtg 42960 

tgtagcgtgt tttgtcctca cactgacagc tgtaggctgg ggttctgagt gagccccaca 43020 

gggcagaggc agaaaatgag tctcagagag ggtgagcgag ctgcttgggg ccccacagca 4 3 080 

ggagatggag caggactgca gcctagcctc tgcccccagc acctgcgcaa gaagctgctc 43140 

tgctctggac tgtgttaggc tgcgagggct ggagagaaat gagagttggt gcttagagag 432 00 

ggggcgcagg tccccatggc ttttcctctt atgatgaggt agatgggtga agggaggggc 43260 

catgcttgca ggggccagtg accgaggccc gccgttggaa ctgatggcct tcatcccgag 43320 

cccagcccag gtgggagcag ggctttccga gggcttgtct tgggtcggcc tgcttccagg 43380 

gactctgctg cagctcccac ccctgtccaa agcatggaat cccccaggct ccctggcagt 43440 

cctgtcaacc tctgtcctcc caagctgagt gtggggcaag ttctggaggt cagcactgct 43500 

caggggggcc cacgggctgc ttgcaggggc caaccgcctg accctggagg acgccaacat 43560 

cgtgcagcct ctgggcctga ccatccttgg caagcatctc tactggatcg accgccagca 43620 

gcagatgatc gagcgtgtgg agaagaccac cggggacaag cggactcgca tccagggccg 43680 

tgtcgcccac ctcactggca tccatgcagt ggaggaagtc agcctggagg agttctgtac 4374 0 

gtgggggctg gcagtggggt gggcagggtg gcctctaaac ccgacccctg gaggaggctg 43800 

gaggccagtg caagatcctg tgtggcctca gccaggcggt ggtctctgcc agatgccaac 43 860 

tgttgcccgc tggggttcag cgacatgtcc gaatgtcccg aggcctctga ggttgttttc 43920 

ttttgccgca gaacaaatca ccacgaacag cgttttaaga caacaccaac tctttttttt 43980 

tttttttttt tgagtcagga tcttgctctg ttgcccaggc tggggtgccc tggtgcaaac 4404 0 

acagttcact gcagcctcga cctctgggct taattaagtg aacaccttgc ctcagcctcc 44100 

caggtagctg ggactacagg tgggcaccac cacacctggc taattttttt ttgtagagac 44160 

ggggtttccc catgttgccc aggctggtct gcaactcctg ggcacaagct atctgcctgc 44220 

tgtggccfccc caaagtgcta ggattatagg tgtgagccac tggcctgaca acacccacgg 44280 

attgtctctc agttctgtaa ggcaaagtcc aggcacagcg tggctcacct gggttctctg 44340 

ctcagggtct cacggggcca gaatcaaggt gtcaggaacg ctgggccctc agcggaggct 4440 0 

ctgtggagaa attagcttcc ttgctcactc agcaggtagc agttgtggga tcgaggttct 44460 

gttttctctc tggttattgg tcggggacca ctctcagctc ctagaggcca ccacaggtcc 4452 0 

ttgccccgtg gccctctctg cctcagcagt gggggctccc tgcgtcagtc cctcccacac 44580 

cttgagtctc tctgatttgc ttctaaaggg ccctgtgatt cggctcagcc acctttagat 44640 

taggttagcc tcccctttga tagactccaa gtcggctgat taataacctt aatcacatct 44700 

gcagaatccc ttctgccaca taaggtcatg acgccgtgct ggggactggg gtgggaaatt 44760 

acggggtcat ttaggattct gcctgccact gccttgctgt gtcccagggc ttgggggagg 44820 

ggcctccaca gctgggacca cagtccttcc tcccctccat ggtaaccatc tgaggattac 44880 

ttgagaccag cctgggcaac atggtgagaa cccatcccta caaaaaatac aaacaaaaag 44940 

ggaccaggct gggcttggtg gctcatgcct ataatcccag cactttggga gaccaaggtg 45000 

ggctgatcac ttgaggttgg gagttcgaga ccagcctgcc caacatagtg aaatcccgtc 45060 

tctactaaaa atacaaaaat tagctgggtg tggtggcagg cgcctgtatt cccagctact 4512 0 

ggggaggctg aggtgggaga attacttgaa cctgggaggc ggaagttgca gtgagccaaa 45180 
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attacgccac 
gggccagggg 
gagactgagt 
gtcatttctt 
ttgtccgggt 
tggtccattt 
tttatttatt 
gatctcggtt 
ccaagtaact 
gtagagacgg 
ccacgcacct 
catttgctat 
caaactggat 
ctgcgaccaa 

gttggccctt 

ctctgcccat 
ctgtgttgcc 
gatgacctcg 
tcacagattc 
tatacaatga 
tttaggaagc 
atgatgagac 
tggtggctca 
caggagattg 
aaattatggc 
gggtgaatta 
agatcacaag 
taaaaataca 
aggctgaggc 
ctccatgcca 
aaaaaaaaaa 
acttgggagg 
gagatcatgc 
agccaggcat 
ttgttcgacc 
ctgggtgaca 
agttcagact 
caagcaactt 
aagactctgt 
gcaggagggc 
aggtgggaga 
tgtgcccgtg 
cggtgctcat 
tgtgacctag 
gctgcctggc 
tgtcttcctt 
taagtactgc 
aatattttta 
cttagtttct 
attctactgt 
gtggtgtttt 
gtttttgaga 
tcaccgcaac 
tgggattaca 



tgcactccag 
tggtagtgac 
aaggacactg 
ccacctgggt 
gttggacaca 
gctattttat 
tattgagatg 
cactgcaacc 
gggattacag 
gttttcgcca 
cggcctccca 
tttaattccc 
ggcttaaagc 
actgttggga 
ccagtgctgt 
cttcatgtgt 
tcttataagg 
tcttgagatc 
taggccttaa 
aagacgggaa 
tgaagcggga 
cccatttcta 
cacctgtaat 
agaccatcct 
cgggcgcagt 
caaggtcaag 
gtcaagagat 
aaaaattagc 
aggagaatgg 
ctgcactcca 
aaaaaaagaa 
ctaaggcagg 
cactgcgctc 

ggtggtgcat 

cacggagatc 
gagtgtgacc 
gatggtatgg 
ctgtaattgt 
aggctgccaa 
cagttctcat 
cagagcctga 
acaatggtgg 
gcccagtcca 
gtgctccttt 
atccttttaa 
tgccctcctt 
gtttcctgca 
cacttctcct 
aaagagttgg 
agtctgagag 
tgttgttatt 
cagtgtcttg 
ctctgcctcc 
ggtgcacgcc 



cctaggcaat 
aaagagaccc 
tgaaattttt 
gtggtcagct 
tttggttaac 
tttattttat 
gagtttcgtt 
tctgcctccc 
gcacccacca 
ttttgcccag 
aagtgctggg 
atgtgtatta 
aacagaaatg 
gggctgtgct 
gggtgccagc 
ccatctcctt 
acgtttgtca 
cttaacctgc 
gacatgggtg 
gagggccagg 
ggatcacttg 
caaaaagtga 
cccagcactt 
ggctaacacg 
ggctcccgcc 
agatcgagac 
ggagaccatc 
cgggcatggt 
cgtgaacccg 
gcctgggtga 
aattagccag 
agaatggcat 
cagcctgggc 
gcctgtagtc 
aaggctacag 
ctgtctcaaa 
gcacagtaga 
ttccggcatc 
gaggaataag 
gagttctcat 
cctctgtttg 
ctgctcccac 
cctcgtgctc 
ggggtgatgg 
aatcacagtc 
tcttttctac 
tctcacaaat 
gagatgacat 
ggcttttcca 
cttattttat 
gtttttgtgt 
ctctgtcact 
cgggttcaag 
accataccca 



agagtgagac 
tatcccaaaa 
ctgggtgggg 
ctccctccaa 
tggatagaat 
tttattttat 
tttgtcgccc 
aggttcaagt 
ccataccagg 
gctggtcttc 
attacaggca 
gtgtcccacg 
gattccccca 
tcctctgggg 
gttccacact 
tgtgtctgcg 
ttgggtttag 
aaagaccctt 
tatctttctg 
tgtggtagtt 
agcccaggag 
aaaaaaaaaa 
tgggaggctg 
gtgaaacccc 
tgtaatccca 
catcttggct 
ctggctaaca 
agcgggcgcc 
ggaggcggag 
cagagtgaga 
gcacagtggc 
gaacccggga 
gatagagcaa 
ccagctactc 
tgagccatga 
gtaagtaaat 
gactggtgca 
catttgtgtg 
tgggaggatc 
ttggccccta 
tcttgttttg 
atctgtattg 
ctgcagaacc 
acaggtacct 
cctgtggcat 
tatgtgcatt 
tttgttacat 
ctttggctca 
gctgtctctc 
atgatttctg 
ctttttgttt 
caggctggag 
tgatcctctt 
gctaattttt 



tccgtctcaa 
aaaccgaaca 
cagggaacag 
gctgcctcct 
aacgcgagtt 
tttatttatt 
aggctggagt 
gattctccta 
ctaatttttt 
aactcctagc 
tgagccacca 
gctgctgtaa 
atgtgctgga 
gctccaggga 
tgtggatgcg 
tctttacctc 
ggcccaccca 
tttccaaaaa 

gggggcacta 

cacgcctgta 
tttacaagta 
aaaaaaaaaa 
aggcaggcag 
gtctctacta 
gcactttggg 
aacacggtga 
cggtgaaacc 
tgtagtccca 
cttgcggtga 
ctccgtctca 
aggtgcctat 
ggtggagttt 
gactctgtct 
aagaggctga 
tcgcaccact 
aggaggagag 
gacaggctgg 
tcaatttccg 
ctcccagaga 
ccctccaggc 
tctttgcagc 
ccaagggtga 
tgctgacctg 
gattctctgc 
ccagtttcca 
cggtgctatg 
tttcattttc 
tgtgttattt 
tgcaactgat 
ttattttaaa 
tgttttgctt 
tgcaatggcg 
gcctcagcct 
gtatttttag 



aaaaaaaaaa 
ctgaatcctt 
agcgtcttct 
cttcttctca 
cccagggact 
tatttattta 
gcagtggcgc 
cctcagcctt 
tgtattttta 
ctcaggtgat 
cgcctggcac 
caaatgacca 
gaccagaagc 
ggatctattt 
ccgcctcaac 
ttcttcttgt 
aatcatccga 
aaggttatgc 
tccaacccct 
atctcagcac 
gctaggcaac 
aagccaggtg 
atcacgaggt 
aaaatacaaa 
aggccgaggt 
aaccccatca 
ccgtctctac 
gctgctcggg 
gccgagatcg 
aaaaaaaaaa 
tgtcccagct 
gcagtgagcc 
caaaaaaaaa 
ggcaggaggg 
gccctccagc 
acaagtgggc 
cctgtgatgt 
tgtcagtagg 
ggccgggcct 
tgtggttctg 
agcccaccca 
tgggacacca 
tggaggtagg 
ctgctaggct 
aagctgattg 
aattttcctc 
aggtagtttg 
agaagtgttg 
ttctaattta 
tgtgttgggt 
cgtttgtttt 
cgatctcagc 
cctgagtagc 
tagagacggg 



45240 
45300 
45360 
45420 
45480 
45540 
45600 
45660 
45720 
45780 
45840 
45900 
45960 
46020 
46080 
46140 
46200 
46260 
46320 
46380 
46440 
46500 
46560 
46620 
46680 
46740 
46800 
46860 
46920 
46980 
47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
48060 
48120 
48180 
48240 
48300 
48360 
48420 
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gtttcaccat 

cctcccaaag 

atcacccagc 

ggattctgct 

agtgctcatc 

tgacaggggt 

tatctttttc 

tggtgggaga 

atcctggctc 

ctccacttcc 

gtaacagggc 

catgggtgtg 

cagggctttc 

gctctgctca 

cagcagcaca 

tctcgctttt 

acctcctggg 

cccaccacca 

gccaggatgg 

cagggattac 

tgttcctgga 

ctcacacctc 

cttcccagca 

ggtgtctgtg 

aacgctgcac 

accccctggc 

ctcgagagac 

ggagatcaca 

aggagccagg 

ttcagccctt 

tcacaggagc 

gggaggctga 

gtatgcccca 

cacggcttgc 

tcctcctgtg 

tctttgatca 

gtcatgggga 

gaaaccgatg 

agtgaggcct 

ttttctgcac 

gcctcctctg 

gggactctgc 

ccgaggggca 

cctctccctc 

tttgtcccgt 

ttgctatgcg 

ccgcaacagg 

agtgggcact 

agtcagggct 

ttaaagctct 

atgtcttctt 

ggagcggcac 

gggatgccaa 

tgcctgggtt 



gttggtcagg 

tgctgggatt 

atcatgcagt 

gttgttgggt 

aggttgtctc 

gtggag'tctc 

tctccttcta 

gtggggtagt 

tgtcacttcc 

tcctctgtga 

actccacctg 

gcagtagtaa 

tctcctatgc 

gcgacctttc 

cactggctgt 

ttcgcgcagg 

ttcaagtgat 

cgcccggcta 

tcttgaactc 

aggcgtgagc 

gagactggct 

gagtttggcc 

gcttttctcc 

cacctcccac 

taactgcagc 

ctgggccctg 

gtgtatttac 

gaaggacagg 

ctgggcaaga 

ccctgagccg 

ttgtggtgcc 

gaggctgctc 

tgtcgccctc 

agacatctgt 

gctgagccca 

ggccccctcc 

aggcctgcgg 

ttttttacct 

gggccgattt 

aaagtgggag 

gcttctctgt 

tattccctga 

ggatggcctc 

ctccctttca 

cttctgcagg 

cacgctggtc 

tgtggctgct 

caggcgtggc 

gaatttccca 

gattcccagt 

gcagtgaatg 

acaggagccg 

acccgcgctg 

ttgctgggcg 



ctggtctcga 
ataggcgtga 
ttatcttggt 
ggagtgttcc 
tatccttcct 
caactctaat 
cccttgatcc 
gaagaacctg 
cagttgagtg 
aattgagagc 
gaccctgaca 
taaagtgacc 
ccccatgcct 
tcccgtggga 
gcaccctttt 
ctgaagtgca 
tttcctgcct 
atttttgtat 
ctgacctcag 
caccacaccc 
cctgcccagg 
gtaagcagag 
aagttacagc 
tgatgcccga 
cttgggcctc 
ggccctcctg 
cgcacaggtg 
gcctcactga 
ggcggccaca 
ggcgcgcccc 
cagcggctgc 
ggtttcacaa 
aacctccagc 
cgttggctcc 
gtggggacag 
cctatgccac 
gttccaaaca 
tgatcagatt 
cccagcatcc 

ggaggagtcc 

gccagtgccc 
gtgctgccgc 
catggtcaca 
gcgccctccc 
cgcctgggac 
accacagagg 
gaccacctgc 
tgagcagatg 
cccggagcat 
tggcaccctt 
tgtattctcc 
cagcgatgga 
agtccctctc 
gggctgcgtg 



actcctgacc 

gccactgtgc 

gaatgttctg 

agaaacatca 

tcctgactgc 

ggtggatttg 

tcttctcccc 

gactttaggg 

accctggctg 

acttacctgg 

cgtgatgcac 

atctgtatcc 

ccaggtggcc 

gttcctgggg 

tttttttttt 

gtggtgtgat 

caccctccca 

tttcagtaga 

gtgatccgcc 

ggagtgccgg 

agctcgggga 

gggacatttt 

ccaaaagctc 

actgccctgg 

catgccagag 

gctcgggaac 

cttgtcattc 

ggtctcggac 

gtcacctcag 

tggctctggc 

ttctgattga 

ctgctgaggg 

caccggtcct 

tctcagccgt 

ctgcttcctt 

acagtccctg 

tccaaaggct 

tcagcttggc 

cctcctgagg 

taggaaatgg 

ccacgctttg 

atgcccagcc 

cgtaggaagt 

cgggggtgtc 

gtgctgacag 

cctggccctt 

tttctgcccc 

tgtgttgccg 

ttctgctgta 

tcccttctgc 

cagcctctct 

ggatgtgcgg 

aacttctgct 

tgatgttctc 



tcgtgatccg 

ctggccatta 

tgtactcttg 

attagatcca 

ctgcttgagc 

tttatttctc 

ctagggcttc 

ccaaagaggc 

gtgcctgaat 

caggctgtca 

aggaatgcca 

tcaccacagt 

ttggatcctg 

gttcagcttc 

tttttttttt 

cttggctcac 

agtagctggg 

gatggtgttt 

cacctcagcc 

ttgtttttag 

gtagggccgc 

gtgactgtcc 

aggtggattt 

ccaagaaacg 

gccatgccct 

tccaggcccc 

tcttgtggcc 

atggaccctt 

cagtgccatc 

cccagtgtcc 

gagtcgaggt 

agacttgggc 

ccgtgtcccc 

cgtgggctga 

ttattaccct 

tcactcgggt 

tgcgtgcagc 

gggggctttg 

ccagcctctg 

ggggccacct 

tgtctgtgtc 

cgcactgagg 

ggcctccacc 

ctcagccctc 

gtcctctgcc 

cttctgtagc 

tctggtcctg 

ggaggaggaa 

tttggtgtag 

attgaaaaac 

tctgggttgg 

gtgcagcacc 

ttgaagccca 

ctctgtccct 



cccacctcgg 
ggtgtgtttt 
aaaagaatgt 
gttggttaat 
tgtcagttat 
ctagtagttc 
ctggtgttag 
cagggttcaa 
ctctgtgagc 
tgggcatcaa 
gctgctatgc 
gaagcctgtc 
ttggttctgt 
atcctacaga 
tgagatggag 
tgcaacctct 
attacaggct 
caccatgttg 
tcccaaagtg 
cagtttgtct 

ggggtgctgc 

ccctcctgag 
gcaacccaac 
gggccgtcag 
tccatccacc 
ttcctcacgg 
tcttctccag 
tgatagtggc 
accaccgcca 
cagttacagc 
cggaggcttt 
tccatctcag 
catggccagg 
ccctggcacg 
agaactctcg 
gagcccagta 
atgacagctt 
ctcagctttc 
tttcctgtga 
cgaagcctag 
cccagcccat 
acgtggagcc 
ctccgatgat 
ctgcctgtgc 
ggctcctgcc 
agtcccacac 
aggagggcgc 
ggactgctcc 
cgcctgctgc 
atacggatgc 
ggctggaggt 
ccgtacagca 
gtcacgccat 
cccccagagc 



48480 

48540 

48600 

48660 

48720 

48780 

48840 

48900 

48960 

49020 

49080 

49140 

49200 

49260 

49320 

49380 

49440 

49500 

49560 

49620 

49680 

49740 

49800 

49860 

49920 

49980 

50040 

50100 

50160 

50220 

50280 

50340 

50400 

50460 

50520 

50580 

50640 

50700 

50760 

50820 

50880 

50940 

51000 

51060 

51120 

51180 

51240 

51300 

51360 

51420 

51480 

51540 

51600 

51660 
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cgcccacctg ctccccggac cagtttgcat gtgccacagg ggagatcgac tgtatccccg 51720 

gggcctggcg ctgtgacggc tttcccgagt gcgatgacca gagcgacgag gagggctgcc 51780 

ccgtgtgctc cgccgcccag ttcccctgcg cgcggggtca gtgtgtggac ctgcgcctgc 51840 

gctgcgacgg cgaggcagac tgtcaggacc gctcagacga ggcggactgt gacggtgagg 51900 

ccctccccgt caaggctctg ccaagaccct ggccctgccc tccgggatac gagcttgggg 51960 

ctgcctccgg cctcacagga gtaggggctc tgaaaacctt tgcttgcagg gagattgcca 52020 

agtctgtctt ttaggcccaa caaggaaaac tctgcagttc cacccatcct gtcccaccag 52 080 

gtagtgtggc ttgaaggcag actgtgaggg tctatctcac cttcctgcat taggtcagga 5214 0 

gtttcacaga aacctgaggc acattcaggg gtgggctgca gaggtccatg gctcacaccc 52200 

tggaaaatcc gcccccaaaa gacagtgctg tctccactga ccagtctgtg ggatagtgct 522 60 

taagcctgag tggtttctat caacatgtag aatcaggagg tataaagaga tttgctcagg 52320 

catcctgggc cctctctgac cagcaggatc ttcctttaga tcttgacagt gaaacacatc 523 8 0 

tcttctgtgc cccctgtgag ttttctttca ttcattcatt cattcattca ttcattcatt 52440 

cattcattcg agacagagtc ttgctctgtc acccaggctg gagtgccctg gtgtaatctc 52500 

ggctcactgc aacctctgcc tccagggttc aatcgattct cctgcctcag cctcccgagt 52560 

agctgggatg acaggtgcgc accaccatgc ctggctaatt tttgtatttt tagtagagac 52 620 

agggtttcac catgttggcc aggctggtct cgaactcctg acctcaggtg atccgcccgc 52680 

ctcagcctcc caaagtgctg ggattacagg catgagccac cgcgcccggc ctgagttttc 52740 

cttttatgaa ggacctgctt ggttggttgc ctgccacatg ttgtcagcac catgggccca 52800 

ggactgctga ggagctgttg atgccctcgc tctcccagag ccaccggctc tgttagataa 52 860 

ttcacatgca gtctggccac tgtcctacgt cctcattcac aaagagcaga catttcgtag 52 92 0 

aagatgaggg cctgggagta acctccctgc atgtttttct ataaaggcat agtggttaag 52980 

tccttccagc tcattgacca ttggagaatt ttatggaggc tgtagactag gggctggtaa 53 04 0 

actaagggcc caggggccaa atccagcctg ccacctactt ttgtaaataa agttttcttg 53100 

gtgcacagcc atgcccattc attcatttgc acaatgtctg tggctgcttt catgccaaaa 53160 

gcaagagaac tgagtggtta tgctggagac ctacggcctt caaagcccca gacctcacgt 5322 0 

ctggcccttg acagacagag cttccccagc cctgctgcgc atcctggccc agcatgtgct 53280 

gtgtgtgtga tttcagcttg caggagccgt ggttaggaat tgtccctgtg ttggtccatt 53340 

ttgcattgct atgaaggagc acctgaggcc gggtagatta tgaaggaaag aggtctgtct 53400 

ggctcatggt tctgtaggca gcaccagtat ggcacccgca tctgctcagc ttctagtgag 53460 

gtctcaggaa gctttgactc atggtgaaag tcgaagcggg agcaggtgca tcacatggtg 5 3520 

agagagggag caacggagag agagagagag cgcctctccc tcttgccctc accttgagag 53580 

gagatgccag gctcctttaa gtaaccagct cccatgtgaa ctcacagtga gagcccattt 5 3640 

gctactgcgg agagggcacc aggcatctgc tcccatgacc caaacactgc ccaccaggcc 53 700 

ctacctccaa ccttggggtc atattttatt ctgttctatg ctatgctatg ctatgccatg 53 760 

ccatgccatg ccatgctatt cctattctat tatttgagac agaatctcgc tctgttgccc 53 82 0 

aggctggagt gcagtggcat gatcttggct cactgcaacc tccacctccc aggttcaagc 53880 

gattctcctg tatcagcctc ccgagtagct gggattacag gcacacacca ccacacccgg 53940 

ctaatttttg tattttcaat agagatgggg tttcaccatg ttggccaggc tggtctcaaa 54000 

ctcctggcct caagtgatcc acctacctcg gcctcccaaa gtgccatgat tacagatgtg 54060 

agtcactgcg cccagtgagg gtcacatttc cgttgagatt tggaggggca gacgttggag 54120 

ccatctgagc cccctcgtcc cgctctagct tctcctcccg tgtgccccgc ggtgctggtg 54180 

gcaggccctt acgccggttc tggctgcatg ctctgttcca gaagctttct tccctgcttg 54240 

gttaccagaa aatcatccca tccattacaa ggacagggtc cccttatctc ccattcccag 543 00 

ggcaggacac cgggggcagg gcaggtgggg aactgagcaa gttctctggg ggcaggcgtg 54360 

gctatggctc cctctgggtg ggcgtctggg gaggggtgga ggcagccgtc agcgccctgg 5442 0 

cttgctcttc ctccctggcc agagactgtg gccttgtgct gctcccgtgt gggctgcctg 544 80 

cacctccagt gggttgtgct ccctcccctc ccctcccctc aagctctgct gagcaccact 54540 

gccttccaca gcccccactc tcgggaggcg aggctcctcg tggccattcc tgtccttggc 54600 

acccaccccc ccaccaacct ggtagagcct tgggcggggt ctgttactcc ttgcatggcg 54660 

tagacctccc cacagtaggc acctgacaca tacctcctgg ggggcaggca ggaggtgcgt 5472 0 

tgaggtctca gccctggcag tccctcccct gcgtggcata ggcctcgcca cagggtcatc 54780 

gagggtgggt ggagactgta ctagaccact ccccgctggt cctagaaagg gtcccatctg 54 84 0 

tctgctctct gtttggagtc cagaccttgg ttgctgtgcc ctgcatggtg ggctgggggg 54900 
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caccctccag 
ttccggtgtg 
tgtatcgacg 
cgtccgggct 
ggggctgggg 
agcctttccc 
gagtctgcct 
cgggaccttg 
tgttccctgg 
catgtgcaca 
cagtcacgga 
aggcagtggc 
aagtagcaat 
ggctgagagc 
ggcttcctca 
ctgaaggcag 
gacgggcctg 
ccgccagagc 
ccggccctgt 
tccctatggg 
ctcccgtggc 
ctgggctggc 
ataccaggct 
gtcagaggag 
caggctcccc 
aaagccagcc 
ttaatgttgg 
gacagcccgg 
gtcatgggtg 
aacgggccct 
gccccgggcg 
ctggaggagg 
ccaggccctc 
actgagcctg 
ccaccctttg 
aggacatgtg 
ctgggctctc 
gcagctggga 
aggctgtgtt 
tcacatgggt 
cccacagctc 
ccctgtcctg 
gtatagatgt 
aggagcccac 
ctgttcctgg 
tggaggaggt 
gtggccagag 
tggtctgagt 
tggctgagcc 
gaaagtccat 
accggaacca 
acccgccggt 
ccaccgtcag 
tgaggcatgc 



cctctctgag 
cgagcggcca 
gctccgacga 
gggttccccc 
gctgtgtggg 
agggcagcgg 
cagtgggcgg 
ctgagcagcg 
cagcttactg 
cacatgtaca 
tatagggcct 
tcctgaaagg 
gagtcattgt 
tcctctctta 
agggactccc 
tggtgggggc 
tgccgtgggc 
ccgcacgctt 
gggagagtga 
cagccttgat 
tggtgggcgg 
cagggcgcgc 
gccttggcag 
gaggagggtg 
aggcctagcc 
ccttcagggc 
ccacctcttt 
cccacagcag 
gtgtctattt 
tcccgcacga 
gttcccagca 
caggagagta 
ttgcacatgt 
tgcttggcat 
gcactgtagg 
ctgtgttgtc 
tccagagctg 
tttgtggctg 
tgtagatggc 
gaccatggta 
caggccacct 
tcccactgtc 
aacctgattc 
agtggtggca 
cgtgctctgt 
accatgtgtc 
aggcctgcgg 
ctcgtgggta 
tggaagccac 
gatgagctcc 
cgtcacaggg 
gaggggcggg 
tgctggccac 
ttgctttctt 



tgcatggcct 
gtgtgtcctc 
gctcatgtgt 
aggaacgtgg 
agactcaggc 
ccaggaggac 
ggctgatggt 
tctgtcaggc 
tctggctgag 
tgacaacatc 

aggggatagg 

ctgtctgatt 
gtctggggac 
tggctgcagt 
tgtgcccttt 
caccaagggt 
cccagctgtg 
ccattccgct 
atgtggcttt 
agtggagtgg 
caccaggggc 
tcacttctgc 
ggatgtgttg 
caccgtgcag 
tcccagctcc 
gcacggtgtc 
ctgtttgtct 
tgccatcggg 
tgtgtgccag 
gtatgtcagc 
tggccccttc 
gtctgggcag 
ggcagacatt 
cagaatcctt 
tgtggagaaa 
cacaaatcct 
aaggaatcac 
agcagttaca 
tgggcagccg 
tgtctaagaa 
tgaggattgc 
accccaagcc 
tctctgggga 
ccatcacagt 
gctgccccgc 
tcccgtttca 
ctcagggagc 

gtgggagcag 
ctgacctccc 
gtgagcctga 
gcctcgtcca 
gccggggagg 
cggaggcttc 
cccttttcaa 



ctccttgcag 
atcaaacagc 
ggtgagccag 
agtttagggg 
ggctgggagg 
agactgtgag 
gtccaggtgc 
agcaagatta 
gaggagtgat 
ccacatgctc 
aggccaagac 
caggcaggaa 
atggccactc 
cgggagagaa 
ggcaccttcg 
cgcctcctct 

ggggcgctgt 

gacttcatcg 
tgccaaagtt 
cccaaggagc 
tgcctggctt 
caccaccgag 
agggctgtgg 
gctgaagggc 
cccactttct 
tgcccccaag 
ctggcagaaa 
cccgtcattg 
cgcgtggtgt 
gggaccccgc 
acaggtaagg 
ctttggggag 
gctaatcgat 
caacacagag 
gctccttgga 
catgtaccct 
tctgtaccat 
attcctacgt 
caccgcagag 

ggtggagtcc 

ctctgcctgc 
ggcctcattg 
gcggggttat 
cgcagcagcc 
ctgggttccc 
cagatgagcc 

ggggcacatt 

aggagagcgc 
ccgtcccttc 
tggggggccg 
gcagctcgtc 
ggcggggcgg 
ccgggttcct 
acccttctgc 



ccatctgcct 
agtgcgactc 
cttctggcac 
aggagacgtg 
ctccttgcgg 
ctgtgggctc 
ctgcagcacg 
cccgagggct 
gttcacatat 
ctcaaatagc 
agtcagggaa 
gggagctgag 
cttcgctgca 
gtctgttggg 
tgccaggtca 
gctgggcaag 
tgatgcgcag 
acgccctcag 
gagtctggag 
ccacccagcc 
tgctcgttca 
ggccctgggc 
ggagtcggac 
cacgttaccc 
ccccaccctc 
gagggcccat 
tcaccaagcc 
gcatcatcct 
gccagcgcta 
acgtgcccct 
agcctgagat 
tggagcaggg 
cacagcattc 
gcctgcatgg 
cttgaccttc 
agaaatgaat 
acagcagctt 
ggcccaggca 
ctgcaccatg 
ctgtgaggtc 
ccagccctga 
ggagcctgtt 
ctggcttctc 
cccagagaac 
tgccccagtc 
ccggggagct 
tccaacagga 
cctatgtctg 
cctgccaggc 
gggcggggtg 
cagcacgaag 
gatggggctg 
gggggctgtg 
ttccttcttt 



gcccaaccag 
cttccccgac 
ggggaagggg 
cctttccagc 
gaggcaggga 
ggcggctaca 
cacccaccca 
gcagtggtcc 
gcacacatgt 
atgacctgta 
gactttccag 
ttcagatagg 

gagggacctg 

gggagaaggg 
ggcttgaggc 
ttcccagtct 
ccaggcctcg 
gatcgctggg 
cctggaaact 
gaccctgccc 
ccaacatcac 
gaaggagtga 
agcggcgggg 
tgaggttggc 
caccagtggc 
tccgttgggg 
gccctcagac 
ctctctcttc 
tgcgggggcc 
caatttcata 
atggaatgat 
atgtgctacc 
agcctttccc 
ctgtagcaac 
atattctagt 
gtgggggcgg 
tgtcttgagt 
ccaggaacgc 
ctggtttgta 
tgcaggtgcc 
gttccctctc 
ggatggcagg 
aagagctcct 
gcggccctgt 
gcaggcccct 
cactctagta 
cacaccgccc 
tggggcggct 
atcgcatgcg 
cccctctacg 
gccacgctgt 
tgggcccctc 
ccaccgcctc 
aatgacattg 



54960 
55020 
55080 
55140 
55200 
55260 
55320 
55380 
55440 
55500 
55560 
55620 
55680 
55740 
55800 
55860 
55920 
55980 
56040 
56100 
56160 
56220 
56280 
56340 
56400 
56460 
56520 
56580 
56640 
56700 
56760 
56820 
56880 
56940 
57000 
57060 
57120 
57180 
57240 
57300 
57360 
57420 
57480 
57540 
57600 
57660 
57720 
57780 
57840 
57900 
57960 
58020 
58080 
58140 
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ttgattgtgg ataatctgaa aactacacaa aaatataaag agccaaaatc tcacccaaat 58200 

ccacctccta gagtggctgt tgggctccgt cagcatccag gcggccgtct gtgttccgca 582 60 

cggcccagcc catcgatagc cgcctgcacc aggcctgtct gccctctgtg agcctcccca 5832 0 

cagggttccc tccacaaaca ccctgttctc ccacccaggg ctggctgctt cctggaaaac 583 8 0 

agctggatgg ttttgtgcat gacagacaaa cacagggtga ttttcgtggc taaaatactc 5844 0 

cctggagctt ttggcagggt gaggggctgg ctccagctga gccacgcctt gagtgaaatg 58500 

actgtgagga gaataaactg ccgctgccct ccaggatcac tggggctggc tggggagaac 58560 

ccccgtttct gggagcacag tcccaggatg ccaaggcgag cttggtgccg agatgtgaac 58620 

tcctgagtgt aaacagcggg ggctgacttg acatgctttg tatgcttttc atttgttcct 58680 

gcagctgtat gcccctaagg tgagtccagc ccccttctgc ttcctctggg gcctcgccag 58740 

tgagccccac cttgctgggg ctggttcctc ctgcccttct gggtatccct cacatctggg 58800 

gtcttgtctt cttgttttct ttttcttttt tttttgagac ggagtttcac ttttgttgcc 58860 

caggcttcag tgcaatggtg tgatctctag gctcaccgca acctctgcct cccaggttca 5892 0 

agcagttccc ctgcctcagc ctccctagta gctgggatta caggcatgtg ccaccacgcc 58980 

cagctaattt tgtattttta gtagagatgg ggtttctcca tgttggtcag gctgatcttg 59040 

aactccctac ctcaggtgat ccgcccacct tggcctccca aagtgctggg attacaggcg 59100 

tgagccaccg cacctggcct ttttcttttc ttttcttttc ttttttctga gacagggtct 59160 

cgctctgtca cccaggctgg agtgcaatgg tgtcatcatg gctaactgca gcctctacct 5 9220 

tctaggctca agcaatcctc ccatctcagc ccctaagtag ctaggactgc acgcatgcat 59280 

ccccatgccc agctaatatt tacatttttt gtagagatga agtttcacta tattgcccag 5 934 0 

gctggtctcc aactcctgga ctcgagcgat cctcctgcct cggcctcccc aggtgctggg 59400 

attacaggcg tgagccaccg tgcctggcct ggggtattgt cttcttatgg cacctgactg 594 60 

tggtgggccc tgggaaggaa gtagcagaag agggttcttc ttggtttcct ggacagtaac 5952 0 

tgagtgttct ggaggcccca gggcctggct ttgtttaggg acaaagggaa ctggtaacca 5 9580 

gaagccgaga gtttaaacac ccactgccct tcttccctgc tcctgctgct gcaacccagc 59640 

ttaaccagcc aggagtgcta ggaacccaag cagggccccc gagcacacag caggcagctc 59700 

acgaattctc ttttcctgtt ctcccttggg agctgggagg atcttaatca ggcaataaga 59760 

gatggcactg agcagccagc taatttttta aatcacttta ttgtttaacc atatgactca 5982 0 

cccacttaaa aaagggtaca gttcagtggg ttttagtgta ttcacagatg tgtgcaaccc 59880 

tcaccacagt taattttaga acattttcct gcccctaaaa gaaactctgc atgaagccag 5 994 0 

ctgtttttaa att-agcaaag ttattttgca tcctttaaat atatgttcat ggtacaaaat 60000 

tcaaaagata cagaagagtc tgcagtccaa agagactccg cccccatgac gccaagcagg 60060 

catccctggg aggcatggcc tcctgcagtg tgtttcttct atgtcccccc aggggtcatc 6012 0 

tgtacatatg caagcataca agagcgtgga ctttgttttc caagccagaa gataattgta 60180 

gatttatgtg cagttgtgag aaagagcaca gacccattta tcctctgcct ggtttccccc 6024 0 

agtgctgcct gccatcttgc atgacttcca ttcctatcat aagcaagaca ctgataacga 6030 0 

ttctttcacc ttattcagat tgacataagt gttttttgtt tgttcttgag acaaacttcc 60360 

tctgtcaccc agtgggagtg cagtggcaca atcacagctc actgcagcct caaactcctg 6042 0 

ggctcaagcg attctcctgc ctcagtcccc tcaagtagct cagatggcag gtgtgcacca 6048 0 

tcatgccagg ctaattttta aattttttgt ggaggtgagg cctcactaaa tttcctgggc 6054 0 

tagtcttgaa ctcctgagct aaagtgatcc tcctgcctca gcctcccaaa gtggtaggat 6060 0 

tacaggcatg agccactgcg cctgggctga catatgtgtt ttcgtaagcc cgaaagatag 60660 

catctgaaga gtcaacattg agccttgcct tttgctgcta acgatgtata aaagctgctg 6072 0 

ttctgagcat ttcggaggct cccagctgcc gtgtgcaccc tgcctagagc tctaccgtaa 607 8 0 

cccatctccg ggaggaggtg ctattgtttt cctcattttg caacaaggag gctgaagaac 6084 0 

tgagcatgaa ccactggcct gggtcgttcg gttggtaggc agtggggcca ggccatccaa 60900 

ctcacaacca ccttctactc tgcttccccc gcaccctgaa gtttgttctg ttttgaggac 60960 

acagccgtca cattcttggt ggctgaacag cactccttgt caggcgtggc tgggccccca 6102 0 

ctggagggca tcatggtcct ctctcctgct gcggttgaac cttggctgtt tcaaccactc 61080 

ctgccaagtg gccctctgaa agggacagtc catcttttct cagcagaggg ccacactggc 6114 0 

aaaacggtcc ctggcaccct ttctctccac ctgtctaata tagagtaaaa atggtatcat 6120 0 

gttaagatct tcatttatat ttattttatc atgaatgatg taagcatcat tttgtgtgtt 61260 

taagaacctt tgggcccagc gtgatggctt gcagctgtaa tctcagcact ttaggaggct 6132 0 

gagatgagcg gatcacttga ggccgggagt ttgagaccag cctggccaac atggagaaac 613 8 0 
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cccgtctcta gtaaaaattt aaaaattagc cgggtatggt gatcccagct acttgggagt 6144 0 

ctgaagcatg agaattgctt gaacatggga ggcggaggtt gcagtgagcc gagatcgcgc 6150 0 

cattgcactc cagcctgggc gacagagcga gactctgtct caaaaaaaaa aaaaaaaaag 61560 

aaaagaaaag aaattatcaa tctcctcttt tatggcatat atatatatat atatatatat 6162 0 

atatatatat atatatattt tttttttttg gttatgttca gaaaggcctt ccctgctctg 61680 

atcataaaaa acaacttatt ttcacactct ctctcttttt tttttgagac agagttttgc 61740 

tcctgttgcc caggctggag tgcagtggcg caatctcagc tcactgtaac ctccgcctcc 61800 

cgggttggag tgattctcct gccttacctt cccgagtagc tgggattata ggcatgcacc 61860 

accatgcctg gctaattttg tacttttagt agagacgggg gtttctccat gttggtcagg 6192 0 

ctggtctcga actcgcgacc tcaggtgatc cacccacctc ggcctcccaa agtgctggga 61980 

ttacagacgt gagccaccat gcccagccca cactctcttt cttaacgtcc tcctcctttc 62040 

gttttacgtt cacatcttta attcttctgg gatgtaatta gatttgatga gcaaggtggg 62100 

catccagctt gtttcttggc tgatggctta tgggtggcgt gaattagtcg gggtctatca 62160 

ggaggcagaa actctatgag aatttgaaca gagaaagttc cgtctacagg cttattacca 62220 

gggactggaa tagcagaaat tgaacagtga gatgtacaga gaactctaag aatgcaggaa 62280 

taggccaggc atggtggctc acacctgtca tcccagcact ttgggagacc aaggcgggtg 6234 0 

gatcacctga ggtcaggagt tcgagaccag cctggccaac atagtgaaac cccatctcta 6240 0 

ctaaaaatac aaaaaaatta gctgggtgtg gtggcgcatg cctgtaatcc cagctactcg 62460 

9<? a ggctgag gcaggagaat cacttgaacc tgggaggcag aggttgcagt gagccgagat 6252 0 

catgccactg tactccagcc tgggtggaag agcggaactc tgtctgaaaa aaaaaaaaaa 6258 0 

aacaagaagt tcaacttgaa gggaaaaatg ccgtattgtc tttccctttg ttatgtcacc 6264 0 

agggcacagt ccatcccagg ctggcgctga tccacgggct ggagaggggc tgccccagaa 62700 

gaggacatgc caggaagggc ttggctggtg ttcaggagcc caggccaggt caggtcaaga 62760 

ggtgttgagg ctggacggga gaggccagct aggggctcat gtaggatatg aggggtcggc 62820 

ccatttcaac gtggaaactg agctcttctg cttctctttc ttcttcactg cattaagatt 62880 

caataccgct tgggaagcag gtatttccct tcctataaag gatggttggg agcctgagtg 6294 0 

ttgggagaaa gtgtagccgc tgagttacta acaactaggg ctgccgtcaa gcctatgggg 630 00 

aaagagagaa gaggacattt ggaaggagag agatcaagct gtggcaccct gggagaggac 63060 

cacagaaaag aggccagtga gggggttccc cggtggcatc tgaaggtgtg gcccaaccag 63120 

gaggtccaga ggctgccagc cgagtggccc aggagaggga acctcacagg ggctgagtgg 63180 

gacccaagcc ctatccaccg tcctaaccac ccacatttct cgggaacaag acctcccaca 63240 

gtggcctccc cggcagtgga aatagccaaa ctggcaacat ggactttctt caactgcccg 63300 

ggcgatgctg cctcagtgcc ccagggcagg caggaagctc ccacacccat tctggaatga 63 3 60 

ggggttggag gaaggctgag ctgagcaaag gacccatctc tgctctggtt ggtggggagg 63420 

gagcccatta tacaagagac ccctcagggc tcagtgaggg gtgacagaga cttggggagt 63480 

agtggctgtc actgcagagg tgagagggtt tggagagaag gtacatgcct ttttggccac 63540 

attgagtagc acctggtagc cagttagtaa cgtgtattgg ataaacaaaa gattaaacgg 63 600 

atgcaaaaaa aaatgttggc tttgcttctt tttacccaaa cctcagttcc ctcaagtaga 63660 

ttctgggaac accccctacc tggctggact gttgtgaagt ttaaataagc caggttaact 63 72 0 

tcacctcctc ctttaagaca cagctcagac actgcctcct ccaagaagcc ccctctggct 63780 

tcctgtgtga atatgacggc cctctgggct ctagggtatc ttagaacaat gcttccttat 63 84 0 

ggctttggaa ccccgctgtc tcctggattg ggagcaaatg caggggagga gccacacctg 63900 

actaatctct gggtctccca gcacataagt ggcataaggg cagggctgtg cccgcttcag 63 960 

gcacttactg aaggatgtac ttggcagagg gtaggcagcc ggcggatgag cccctcactc 6402 0 

tccccagctg actgcgtggg cgggaaaggc gggttcagga gacccagcct ccctgggctg 64080 

tcaccacctc tgcacatcca gccccattga tcaagggttc aatttttggg gtcctgttgg 6414 0 

gaggccagga gactctctcc aggcacttct tccaggtctt tgtgttaggg tgtgtgtgtg 64200 

tgtgtgtgtg tgtgtgtgtg tgtgttgttt gttttatttt atttatttat ttatttattt 64260 

atttatttat ttatttattt tgagacgcag tctcgctctg ttgcccaggt tggagggtgg 6432 0 

tggcatgatc tcggctcact gcaagctccg cctcccgggt tcacgccatt ctcctgcctc 64380 

actcttcctg agtagccgga ttacaggcgc acgcaccatg cctggctaat tattttgttt 64440 

ttttagtaga gacagggttt cgccacgttg cccaggctgg tcttgaatcc ctggcctcaa 64500 

gcgatccgcc cgcctcagcc tcccaaagtg ctgggattac aggcgtgagc caccgtgccc 64560 

gcccagccta ggggtacatg aaactttttt tttttttttt ttgagacaga gtttcactct 6462 0 
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gtcctcaggc tggagtgcag tggcgtgatc tcggcgtact gcaatctccg cctcccggtt 64680 

caagcgattc tcctgcctca gcctcccgag tagctgggat tgcaggcacg cgccaccaca 64 740 

cccagctaat ttttgtattt ttagtagaga cgggctttca ccatgtggga caggatggtc 64 8 00 

tcgatctcct gacctcgtga tccgcccgcc tcagcctccg aaagtgctgg gattacaggc 64860 

ctgagccacc gtgcccagcc atgatgtttt gatacaggca tataacgtat aataatcaca 6492 0 

tcagggtaaa tgatgtaacc atcacatcaa gcatttatcc tttgtgttac aaaaaaaaat 64980 

ctaattatac tttcctactt attctttttt tttttttttt ttgagacgga gtctccctca 65040 

gtcgcccagg ctggagtgca gtggcatgat ctcagttcac tgcaagctct gcctcctagc 6510 0 

tctgcctcct gggttcatgc cattctcctg tctcagcctc gcgagtagct gggactacag 65160 

gcgcctgcca ccgtgcccgg ctaatttttt tttttgtatt tttggtagag acagggtttc 65220 

accgtgttag ccaggatggt ctcgatctcc tgacctcata atccgcccgt ctcggcctcc 652 80 

caaagtgctg ggattacagg catgagccac cgcccccagc ctatttattc ttaaatgtac 65340 

aataaattat tgttgactcc agtcaccctg ctgtgctacc aaatacggat cttcttcatt 65400 

ctatctaact gtatttctgt acctgttaac catctctcct ccacctcacc ccccaaaccc 65460 

actacccttc tcagcctctg gtaaccatcc ttctactctc tatctctatg agttcaattg 65520 

tattaatttt tagctccccg gccgggcacg gtggctcacg cctgtaatcc cagcacttca 655 80 

ggaggctgag gcaggtggat cacgaggtca ggagtttgag accagcctgg ccaacatggt 65640 

ggaaccccat ctctactaaa aacacaaaaa ttagctgggc gtggtggtgg gcgcttgtag 657 00 

tcccagctac ttgggaggct gaggcaggag aatcgcttga aactgggagg cagaggttgc 65760 

agtgagccaa gattgcgcca ctgcactcca gtctgggtga cagagtaaga ttccatcccg 65 82 0 

aaaaaaaaaa agtttagctc ccacaaataa gtgagaacac gtgaagtttc tctttctgtg 658 80 

cctcgcttgt ttcacttaac ataatgacct ccagttccat ccacgttgtt gctttgttat 65940 

aaatgacagg atcttggtca ggcgcagtgg ctcatgcctg taatcccagc actttgggag 66000 

gctgaggtgg actgatcatg aggtcaagag atcgagacca tcctggctaa cacagtgaaa 66060 

ccccgtctct actaaaaata caagaaatta gccgggcgtg gtggtgggca cccatttccg 6612 0 

ccccttctcg ggacgctgat gcacgacata ttacccatcc ccggaagact aatcctcccc 66180 

cactctatat tgtacctctt cctttctcct ccacgcgatt ccccgagtaa cccgtcttcc 66240 

ctccctcctc ggattacgct cacctttccg cttcaatcac gttgctccgt ccccttcccc 663 00 

attcgtacca ctcctcactt tcgtcttcct acccccacta tcccttttcg tcctctctat 66360 

tccttactta ctcctccccc ttctcttcat acttcattcc ctccgctctt cccactcgcg 66420 

ctcccacttt cacctagttg ccctcaccta cgttgccatc tcgccccttc ttcagctctc 66480 

ggcctctcac ccatctgtcc tctctcttac ctctctcctc atctcgctca gacatctctc 66540 

tagactatcc ctcactttac cttctcagtc gtcttcttcc tatccttcgt tctccatgat 66600 

cttcacgtcg ccatctcttt tcgccccttt catatgtctc tcttcatgtt ctcactatca 66660 

ttctcatgat cactatcgtt ctcactactt atcactcccc tctttcttca tcaattcctc 66720 

tccgtcattc tcgtctctct cttacaaccg ccttccttgt gctatctaac tcaaccatgc 66780 

ctctcctact ctctctctat cgcccctcca tcgcttatgc atcctcttct attgcacacc 66840 

cgcccctcca tcgcttatgc atcctcttct attgcacacc gcccctccat cgcttatgca 66900 

tcctcttcta ttgcacatcc tcttctattg cac 66933 

<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 12 

ctgagcggaa ttcgtgagac c 21 

<210> 13 
<211> 23 
<212> DNA 



-96- 



BNSDOCIO: <WO 0209200OA2_l_> 





WO 02/092000 



PCT/US02/14877 



<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 13 

ttggtctcac gtattccgct cga 

<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 14 

ctcgagaatt ctggatcctc 

<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 15 

ttgaggatcc agaattctcg ag 

<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<40O> 16 

tgtatgcgaa ttcgctgcgc g 

<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 17 

ttcgcgcagc gaattcgcat aca 

<210> 18 
<211> 21 



-97- 



BNSDOCID: <WO. 



02092000A2J_> 





WO 02/092000 



PCT/US02/14877 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 18 

gtccactgaa ttctcagtga g 21 

<210> 19 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 19 

ttgtcactga gaattcagtg gac 23 

<210> 20 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 20 

gaatccgaat tcctggtcag c 21 

<210> 21 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 21 

ttgctgacca ggaattcgga ttc 23 

<210> 22 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 22 

cuacuacuac uactgagcgg aattcgtgag acc 33 



<210> 23 
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<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 23 

cuacuacuac uactcgagaa ttctggatcc tc 32 

<210> 24 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 24 

cuacuacuac uatgtatgcg aattcgctgc gcg 3 3 

<210> 25 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 25 

cuacuacuac uagtccactg aattctcagt gag 33 

<210> 26 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 26 

cuacuacuac uagaatccga attcctggtc age 33 

<210> 27 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 27 

aactggaaga attcgcggcc gcaggaattt tttttttttt ttttt 45 



.99- 



BNSDOCID: <WO 02092000A2_I_> 





WO 02/092000 



PCT/US02/14877 



<210> 28 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 28 

aattcggcac gag 13 

<210> 29 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 



<210> 30 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 30 

gtacgacggc cagt 14 

<210> 31 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 31 

aacagctatg accatg 16 

<210> 32 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 



<400> 29 
ctcgtgccg 



9 



<400> 32 

ccaagttctg agaagtcc 



18 
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<210> 33 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 33 

aatacctgaa accatacctg 20 

<210> 34 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 34 

agctgctcgt agctgtctct ccctggatca cgggtacatg tactggacag actgggt 5 7 

<210> 35 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 35 

tgagacgccc ggattgagcg ggcagggata gcttattccc tgtgccgcat tacggc 56 

<210> 36 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 36 

agctgctcgt agctgtctct ccctgga 27 

<210> 37 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 37 

gccgtaatgc ggcacaggga ataagct 2 7 
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<210> 38 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 38 

gagaggctat atccctgggc 20 

<210> 39 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 39 

acagcacgtg tttaaagggg 2 0 

<210> 40 

<211> 163 

<212> DNA 

<213> Homo sapiens 

<400> 40 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagct cggcgcgggc ccgtccggcc 60 

gccggacaac atggaggcag ctccgcccgg gccgccgtgg ccgctgctgc tgctgctgct 120 

gctgctgctg gcgctgtgcg gctgcccggc ccccgccgcg gcc 163 

<210> 41 

<211> 419 

<212> DNA 

<213> Homo sapiens 

<400> 41 

gccccacagc ctcgccgctc ctgctatttg ccaaccgccg ggacgtacgg ctggtggacg 60 

ccggcggagt caagctggag tccaccatcg tggtcagcgg cctggaggat gcggccgcag 120 

tggacttcca gttttccaag ggagccgtgt actggacaga cgtgagcgag gaggccatca 180 

agcagaccta cctgaaccag acgggggccg ccgtgcagaa cgtggtcatc tccggcctgg 240 

tctctcccga cggcctcgcc tgcgactggg tgggcaagaa gctgtactgg acggactcag 3 00 

agaccaaccg catcgaggtg gccaacctca atggcacatc ccggaaggtg ctcttctggc 3 60 

aggaccttga ccagccgagg gccatcgcct tggaccccgc tcacgggtaa accctgctg 419 

<210> 42 

<211> 221 

<212> DNA 

<213> Homo sapiens 

<400> 42 

ccccgtcaca ggtacatgta ctggacagac tggggtgaga cgccccggat tgagcgggca 60 
gggatggatg gcagcacccg gaagatcatt gtggactcgg acatttactg gcccaatgga 12 0 
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ctgaccatcg acctggagga gcagaagctc tactgggctg acgccaagct cagcttcatc 
caccgtgcca acctggacgg ctcgttccgg taggtaccca c 

<210> 43 

<211> 221 

<212> DNA 

<213> Homo sapiens 

<400> 43 

tccctgactg caggcagaag gtggtggagg gcagcctgac gcaccccttc gccctgacgc 
tctccgggga cactctgtac tggacagact ggcagacccg ctccatccat gcctgcaaca 
agcgcactgg ggggaagagg aaggagatcc tgagtgccct atactcaccc atggacatcc 
aggtgctgag ccaggagcgg cagccttttt gtgagtgccg g 

<210> 44 

<211> 156 

<212> DNA 

<213> Homo sapiens 

<400> 44 

tttctcagtc cacactcgct gtgaggagga caatggcggc tggtcccacc tgtgcctgct 
gtccccaagc gagccttttt acacatgcgc ctgccccacg ggtgtgcaga tgcaggacaa 
cggcaggacg tgtaaggcag gtgaggcggt gggacg 

<210> 45 

<211> 416 

<212> DNA 

<213> Homo sapiens 



180 
221 



60 
120 
180 
221 



60 

120 

156 



<400> 45 

ctccacagga 

gctggacacg 

tgccatcgac 

catccgcagg 

cgaccccgat 

cacggaccgc 

ggacctggac 



gccgaggagg 
ccggacttca 
tacgacccgc 
gcgtacctgg 
ggcatcgcgg 
atcgaggtga 
gagccccgag 



tgctgctgct 
ccgacatcgt 
tagagggcta 
acgggtctgg 
tcgactgggt 
cgcgcctcaa 
ccatcgcact 



ggcccggcgg 
gctgcaggtg 
tgtctactgg 
ggcgcagacg 
ggcccgaaac 
cggcacctcc 
gcaccccgtg 



acggacctac 
gacgacatcc 
acagatgacg 
ctggtcaaca 
ctctactgga 
cgcaagatcc 
atggggtaag 



ggaggatctc 
ggcacgccat 
aggtgcgggc 
ccgagatcaa 
ccgacacggg 
tggtgtcgga 
acgggc 



60 

120 

180 

240 

300 

360 

416 



<210> 46 

<211> 198 

<212> DNA 

<213> Homo sapiens 

<400> 46 

ttcttctcca gcctcatgta ctggacagac tggggagaga accctaaaat cgagtgtgcc 
aacttggatg ggcaggagcg gcgtgtgctg gtcaatgcct ccctcgggtg gcccaacggc 
ctggccctgg acctgcagga ggggaagctc tactggggag acgccaagac agacaagatc 
gaggtgaggc tcctgtgg 

<210> 47 

<211> 244 

<212> DNA 

<213> Homo sapiens 



60 
120 
180 
198 
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<400> 47 

ccgtcctgca ggtgatcaat gttgatggga cgaagaggcg gaccctcctg gaggacaagc 60 

tcccgcacat tttcgggttc acgctgctgg gggacttcat ctactggact gactggcagc 120 

gccgcagcat cgagcgggtg cacaaggtca aggccagccg ggacgtcatc attgaccagc 180 

tgcccgacct gatggggctc aaagctgtga atgtggccaa ggtcgtcggt gagtccgggg 240 

ggtc 244 

<210> 48 

<211> 313 

<212> DNA 

<213> Homo sapiens 

<400> 48 

gttcgcttcc aggaaccaac ccgtgtgcgg acaggaacgg ggggtgcagc cacctgtgct 60 

tctgcacacc ccacgcaacc cggtgtggct gccccatcgg cctggagctg ctgagtgaca 120 

tgaagacctg catcgtgcct gaggcctttt tggtcttcac cagcagagcc gccatccaca 180 

ggatctccct cgagaccaat aacaacgacg tggccatccc gctcacgggc gtcaaggagg 240 

cc tcagccct ggactttgat gtgtccaaca accacatcta ctggacagac gtcagcctga 3 00 

aggtagcgtg ggc 313 

<210> 49 

<211> 255 

<212> DNA 

<213> Homo sapiens 

<400> 49 

cctgctgcca gaccatcagc cgcgccttca tgaacgggag ctcggtggag cacgtggtgg 60 

agtttggcct tgactacccc gagggcatgg ccgttgactg gatgggcaag aacctctact 120 

gggccgacac tgggaccaac agaatcgaag tggcgcggct ggacgggcag ttccggcaag 180 

tcctcgtgtg gagggacttg gacaacccga ggtcgctggc cctggatccc accaaggggt 240 

aagtgtttgc ctgtc 255 

<210> 50 
<211> 210 
<212> DNA 

<213> Homo sapiens 
<400> 50 

gtgccttcca gctacatcta ctggaccgag tggggcggca agccgaggat cgtgcgggcc 60 
ttcatggacg ggaccaactg catgacgctg gtggacaagg tgggccgggc caacgacctc 120 
accattgact acgctgacca gcgcctctac tggaccgacc tggacaccaa catgatcgag 180 
tcgtccaaca tgctgggtga gggccgggct 210 

<210> 51 
<211> 352 
<212> DNA 

<213> Homo sapiens 
<400> 51 

gtgttcatgc aggtcaggag cgggtcgtga ttgccgacga tctcccgcac ccgttcggtc 60 
tgacgcagta cagcgattat atctactgga cagactggaa tctgcacagc attgagcggg 120 
ccgacaagac tagcggccgg aaccgcaccc tcatccaggg ccacctggac ttcgtgatgg 180 
acatcctggt gttccactcc tcccgccagg atggcctcaa tgactgtatg cacaacaacg 24 0 
ggcagtgtgg gcagctgtgc cttgccatcc ccggcggcca ccgctgcggc tgcgcctcac 3 00 
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actacaccct ggaccccagc agccgcaact gcagccgtaa gtgcctcatg gt 352 



<210> 52 

<211> 225 

<212> DNA 

<213> Homo sapiens 



<400> 52 

gcctcctcta cgcccaccac cttcttgctg ttcagccaga aatctgccat cagtcggatg 60 

atcccggacg accagcacag cccggatctc atcctgcccc tgcatggact gaggaacgtc 120 

aaagccatcg actatgaccc actggacaag ttcatctact gggtggatgg gcgccagaac 180 

atcaagcgag ccaaggacga cgggacccag gcaggtgccc tgtgg 225 



<210> 53 

<211> 235 

<212> DNA 

<213> Homo sapiens 



<400> 53 

ctttgtctta cagccctttg ttttgacctc tctgagccaa ggccaaaacc cagacaggca 60 

gccccacgac ctcagcatcg acatctacag ccggacactg ttctggacgt gcgaggccac 12 0 

caataccatc aacgtccaca ggctgagcgg ggaagccatg ggggtggtgc tgcgtgggga 180 

ccgcgacaag cccagggcca tcgtcgtcaa cgcggagcga gggtaggagg ccaac 235 



<210> 54 

<211> 218 

<212> DNA 

<213> Homo sapiens 



<400> 54 

ccaccctccc gcaggtacct gtacttcacc aacatgcagg accgggcagc caagatcgaa 60 

cgcgcagccc tggacggcac cgagcgcgag gtcctcttca ccaccggcct catccgccct 120 

gtggccctgg tggtggacaa cacactgggc aagctgttct gggtggacgc ggacctgaag 180 

cgcattgaga gctgtgacct gtcaggtacg cgccccgg 218 

<210> 55 

<211> 234 

<212> DNA 

<213> Homo sapiens 



<400> 55 

ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 60 

gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 120 

gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 180 

ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggc 234 

<210> 56 

<211> 157 

<212> DNA 

<213> Homo sapiens 



<400> 56 

ttgtctttgc agcagcccac ccatgtgccc gtgacaatgg tggctgctcc cacatctgta 60 
ttgccaaggg tgatgggaca ccacggtgct catgcccagt ccacctcgtg ctcctgcaga 12 0 
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acctgctgac ctgtggaggt aggtgtgacc taggtgc 



157 



<210> 57 
<211> 272 
<212> DNA 

<213> Homo sapiens 
<400> 57 

gttctcctct gtccctcccc cagagccgcc cacctgctcc ccggaccagt ttgcatgtgc 60 
cacaggggag atcgactgta tccccggggc ctggcgctgt gacggctttc ccgagtgcga 12 0 
tgaccagagc gacgaggagg gctgccccgt gtgctccgcc gcccagttcc cctgcgcgcg 180 
gggtcagtgt gtggacctgc gcctgcgctg cgacggcgag gcagactgtc aggaccgctc 240 
agacgaggtg gactgtgacg gtgaggccct cc 2 72 

<210> 58 
<211> 134 
<212> DNA 

<213> Homo sapiens 
<400> 58 

tctccttgca gccatctgcc tgcccaacca gttccggtgt gcgagcggcc agtgtgtcct 60 
catcaaacag cagtgcgact ccttccccga ctgtatcgac ggctccgacg agctcatgtg 12 0 
tggtgagcca gctt 134 

<210> 59 
<211> 274 
<212> DNA 

<213> Homo sapiens 
<400> 59 

gtttgtctct ggcagaaatc accaagccgc cctcagacga cagcccggcc cacagcagtg 60 
ccatcgggcc cgtcattggc atcatcctct ctctcttcgt catgggtggt gtctattttg 120 
tgtgccagcg cgtggtgtgc cagcgctatg cgggggccaa cgggcccttc ccgcacgagt 180 
atgtcagcgg gaccccgcac gtgcccctca atttcatagc cccgggcggt tcccagcatg 24 0 
gccccttcac aggtaaggag cctgagatat ggaa 2 74 

<210> 60 
<211> 164 
<212> DNA 

<213> Homo sapiens 
<400> 60 

cttccctgcc aggcatcgca tgcggaaagt ccatgatgag ctccgtgagc ctgatggggg 60 
gccggggcgg ggtgcccctc tacgaccgga accacgtcac aggggcctcg tccagcagct 120 
cgtccagcac gaaggccacg ctgtacccgc cggtgagggg cggg 164 

<210> 61 
<211> 130 
<212> DNA 

<213> Homo sapiens 
<40O> 61 

ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc tccctgtaca 60 
acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac aggtaggaca 12 0 
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tcccctgcag 



130 



<210> 62 

<211> 496 

<212> DNA 

<213> Homo sapiens 



<400> 62 

tcaaacattc 

ccgacgacgc 

gccagcaagt 

ccccacagcc 

agctacttcc 

ccgggccact 

aatatatttt 

aactgtgatg 

ttaattttgt 



cggccactgc 
cctgcagcac 
actacctgga 
agtacctgtc 
atctcttccc 
ctggcttctc 
atgatttaaa 
gggtgggcag 
aaaaca 



gagaccgtac 
cgacgtgtgt 
tttgaactcg 
ggcggaggac 
gccccctccg 
tgtgcccctg 
aaataaatat 
ggctgggaga 



aggccctaca 
gacagcgact 
gactcagacc 
agctgcccgc 
tccccctgca 
taaatagttt 
aattgggatt 
actttgtaca 



tcattcgagg 
acagcgccag 
cctatccacc 
cctcgcccgc 
cggactcatc 
taaatatgaa 
ttaaaaacat 
gtggagaaat 



aatggcgccc 
ccgctggaag 
cccacccacg 
caccgagagg 
ctgacctcgg 
caaagaaaaa 
gagaaatgtg 
atttataaac 



60 

120 

180 

240 

300 

360 

420 

480 

496 



<210> 63 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 63 

ttttgggtac acaattcagt eg 

<210> 64 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 64 

aaaactgtgg gtgcttctgg 

<210> 65 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Artificial Sequence is a primer. 
<400> 65 

gtgattgagc caatcctgag a 

<210> 66 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 66 

tgagccaaat aaaccccttc t 21 

<210> 67 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 67 

ctggactacg tggccttctc 2 0 

<210> 68 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 68 

ttcagaagca cttggctgg 19 

<210> 69 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 69 

ctcagtgcca tgaagatgga 2 0 

<210> 70 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 70 

caagatcact cgatctccag g 21 

<210> 71 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 71 

gtttcaggag actcagagtc 2 0 

<210> 72 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<400> 72 

ttctgcaggt tgctgttgag 

<210> 73 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 73 

ttattgtgat ttcccgtggc 

<210> 74 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 74 

gccctctgtc ctgacttcag g 

<210> 75 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 75 

gagaaagaaa taaggggacc 

<210> 76 

<211> 20 

<212> D3STA 

<213> Homo sapiens 

<400> 76 

tgctttgtaa agcactgaga 

<210> 77 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 77 

gaagtacggg cagttcagtg gcct 

<210> 78 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 78 

atacaccaag gtccatgttc cccgt 

<210> 79 
<211> 25 
<212> DNA 
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<213> Homo sapiens 



<400> 79 

agcctgggcc acagcgtgag actac 



25 



<210> 80 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 80 

tcccggagct tgcacacccg cttca 25 

<210> 81 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<210> 82 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 82 

caagattctg tagcttctgg 2 0 

<210> 83 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<210> 84 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 84 

atcctctcac atcccacact 2 0 

<210> 85 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 81 

catgtgccca cctcattcat 



20 



<400> 83 

cagagaagtc aagggacttg 



20 



<400> 85 

caaggctaaa agacgaaaaa 



20 



<210> 86 
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<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 86 

tcaggagcat ttcatctttt 

<210> 87 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 87 

aagtcgaggc tgcaaggag 

<210> 88 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 88 

gccctgtgtt cctttcagta 

<210> 89 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 89 

aaggtgtgag gatcactgg 

<210> 90 
<211> 17 
<212> DNA 

<213> Homo sapiens 
<400> 90 

agctcatggg ggctatt 

<210> 91 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 91 

gcttctccga gtgtatcaac 

<210> 92 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 92 

atggcagagg acttagaaca 
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<210> 93 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 93 

gatcagcgaa cttcctctcg gctc 24 

<210> 94 

<211> 24 

<212> DNA 

<213 > Homo sapiens 



<210> 95 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 95 

gctaatcaca gtctaaccga 20 

<210> 96 

<211> 19 

<212> DNA 

<213> Homo sapiens 



<210> 97 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 97 

gcacagctgt agtggggttc taggc 25 

<210> 98 

<211> 25 

<212> DNA 

<213> Homo sapiens 



<210> 99 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 99 



<400> 94 

tccacattga ggactgtggg aacg 



24 



<400> 96 

ttgcactgtc ttggatgca 



19 



<400> 98 

caggcgcaaa ggacatgcac acggc 



25 
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caccgatgag tgcacgttca aggag 

<210> 100 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 100 

cagacagaga tgctccacgc catac 

<210> 101 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 101 

tttctgggtg tgtctgaat 

<210> 102 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 102 

acacagttgc tctaaagggt 

<210> 103 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 103 

catttgggaa atccagaaga 

<210> 104 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 104 

taggtgtctt attttttgtt gcttc 

<210> 105 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 105 

gacataccat gaacactata agagg 

<210> 106 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<400> 106 

caacccatac cagggataag 



20 



<210> 107 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 107 

gaacaagagg ggtaagttgg c 21 

<210> 108 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 108 

tgaggacaca gatactgatg gg 22 

<210> 109 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 109 

gaagtgttcc ctcttaaatt ctttg 25 

<210> 110 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 110 

gaactatatt gtagttagtg aggag 25 

<210> 111 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 111 

cctgtaaccc ccagtccc . 18 

<210> 112 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 112 

tcttgcttcc taagtttctc gg 22 

<210> 113 
<211> 21 
<212> DNA 
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<213> Homo sapiens 



<400> 113 

actccatcca cctcatcact g 



21 



<210> 114 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 114 

tgctgtttgc ctcatctgac 2 0 

<210> 115 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 115 

gtggacaggc atagctgagg 2 0 

<210> 116 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 116 

tgttcactct tctgcctgca g 21 

<210> 117 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 117 

agctggactc tcacagaatg 2 0 

<210> 118 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 118 

caagaggctg gtagaaggtg 2 0 

<210> 119 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 119 

gactccagtc tgggcaataa aagc 2 4 

<210> 120 
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<211> 22 
<212> DNA 
<213> Homo 



sapiens 



<400> 120 
ggtggcagca 



tgacctctaa ag 



22 



<210> 121 
<211> 16 
<212> DNA 

<213> Homo sapiens 
<400> 121 

caggcccagt ctcttg 16 

<210> 122 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 122 

cgtgtccaga tgaaagtg 18 

<210> 123 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 123 

acctcacggt gtaatccc 18 

<210> 124 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 124 

cttgaagccc atctttgc 18 

<210> 125 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 125 

tatttgcaaa gcttgagact tct 2 3 

<210> 126 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 126 

aatcactgtg ctttgttgcc 20 
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<210> 127 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 127 

actttattgt cagcgtgggc 

<210> 128 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 128 

actccctcga tggcttcc 

<210> 129 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 129 

gagcagggga gagaaggc 

<210> 130 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 130 

cccactggct tgttttattg 

<210> 131 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 131 

agccacttta ttgttatttt gatgc 

<210> 132 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 132 

aagagtgaac aaaagcaaac atacc 

<210> 133 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 133 
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gtggagtgtg ggattggg 



18 



<210> 134 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 134 

tactgttctt gataagtatg tcggc 25 

<210> 135 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 135 

atgcttttgc atgattctaa ttatt 25 

<210> 136 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 136 

tcccccaaaa gaatgtaaag g 21 

<210> 137 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 137 

ctggtcttcc ttgtgtgctg 20 

<210> 138 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 138 

atcacccagg ccagggat 18 

<210> 139 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 139 

tcagaagcag aactgttttt aaca 24 

<210> 140 
<211> 22 
<212> DNA 

<213> Homo sapiens 



-118- 



BNSDOCID: <WO 02092000A2_I_> 





WO 02/092000 



PCT/US02/14877 



<400> 140 

cctgcttgaa agttctagag cc 

<210> 141 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 141 

caagcccggg ttttattgaa a 

<210> 142 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 142 

gatgccagga ccatggac 

<210> 143 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 143 

gcatatagaa acaatttatt gccg 

<210> 144 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 144 

ctctgaagca gggaccagag 

<210> 145 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 145 

ctaccacacc acaccaggc 

<210> 146 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 146 

caagcgaaag ctgccttc 

<210> 147 
<211> 23 
<212> DNA 
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<213> Homo sapiens 



<400> 147 

gttgtcttga cttcaggtct gtc 



23 



<210> 148 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 148 

ttttccttca acaatcacta ctcc 24 

<210> 149 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 149 

gcgtggggat atagaggtca 2 0 

<210> 150 

<211> 20 

<212> DMA 

<213> Homo sapiens 

<400> 150 

tacgtggcca agaagctagg 20 

<210> 151 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 151 

taatatatcc ccagtctaag gcat 24 

<210> 152 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 152 

agcttgcaga tggagccc 18 

<210> 153 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 153 

tggttttaaa cctttaatga gaaaa 2 5 

<210> 154 



-120- 



BNSDOCID: <WO 02092000 A2_l_> 




4 



WO 02/092000 



PCT/US02/14877 



<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 154 

tgttgatcta taccctgttt ccg 

<210> 155 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 155 

aattatttaa aagagaggaa aggca 

<210> 156 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 156 

tggctgtgaa cttcctctga 

<210> 157 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 157 

ggttacagaa aaacatttga gagat 

<210> 158 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 158 

tgagctttag ttcccttctc tg 

<210> 159 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 159 

ttgaaaaacc atttatttca ccg 

<210> 160 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 160 

tctgcggctg ttggattt 
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<210> 161 

<211> 23 

<212> DNA 

<213> Homo sapiens 



<400> 161 

ttgaaaaacc atttatttca ccg 

<210> 162 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 162 

tgttctcttc tcccagcagg 



23 



20 



<210> 163 

<211> 23 

<212> DNA 

<213> Homo sapiens 



<400> 163 

ctttattgaa aacattgagt gca 



23 



<210> 164 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 164 

ttgtcaaatt ccccccaaaa 

<210> 165 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_ feature 
<222> 12 

<223> n = A,T,C or G 



20 



<400> 165 
aaaccacgac cnccaa 



16 



<210> 166 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 166 

ccctggaaag gtaagatgct 



20 



<210> 167 
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<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 167 

cttttggtag agacaaggtc tea 

<210> 168 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 168 

tatctgtctg tagtgettea aatgt 

<210> 169 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 169 

gacgaaggtg attcagggc 

<210> 170 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 170 

actgaagaac tcttgtcct 

<210> 171 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 171 

cagataaaag agtcactatg gctca 

<210> 172 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 172 

cacttctccc actttgtccc 

<210> 173 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 173 

ttattgataa gcattagtga acccc 



25 



19 



19 



25 



20 



25 
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<210> 174 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 174 

tggcaagtta ggcacagtca 2 0 

<210> 175 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 175 

ctatgcccag agatgaacag g 21 

<210> 176 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 176 

tccactaagg gctatgtcgc 2 0 

<210> 177 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 177 

gccagcttta ttgagtaaac ttcc 24 

<210> 178 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 178 

cactggagac tacaagtggt gg 22 

<210> 179 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 179 

catcccaacc atcactcagt 20 

<210> 180 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 180 
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ggggactagc ttacagattt ga 

<210> 181 

<211> 23 

<212> DNA 

<213> Homo sapiens 



22 



<400> 181 

agactacatt ttggaaccag tgg 



23 



<210> 182 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 182 

tgaaaggata tttatagcct gga 

<210> 183 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 183 

gaaggttttg tccctcgatc 

<210> 184 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 184 

tgagggttgg gaagatcata 

<210> 185 

<211> 18 

<212> DNA 

<213> Homo sapiens 



23 



20 



20 



<400> 185 

ccttcatagc cacacccg 

<210> 186 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 186 

cagctaactg ttgacatgcc a 

<210> 187 

<211> 25 

<212> DNA 

<213> Homo sapiens 



18 



21 
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<400> 187 

tctttactgt gcttacaact ttcct 



25 



<210> 188 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 188 

caacagtgca gtcggtatcg 2 0 

<210> 189 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 189 

agatcagcaa gcagatag 18 

<210> 190 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 190 

cattccacat ggatagac 18 

<210> 191 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 191 

catacctatg aggtgtgcta cagg 24 

<210> 192 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 192 

gcattttctc atcatccttg c 21 

<210> 193 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 193 

ttacagccac caaggtttcc 2 0 

<210> 194 
<211> 20 
<212> DNA 
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<213> Homo sapiens 
<400> 194 

aggtgtgtgt gccaggttga 

<210> 195 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 195 

cactgttatc tcattaactg tgagg 

<210> 196 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 196 

tttgattttg tgtctcccaa a 

<210> 197 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 197 

ccccactccc acttttattt 

<210> 198 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 198 

ccagtcacct ttactagtcc tttg 

<210> 199 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 199 

aggacacagc ctgcatctag 

<210> 200 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 200 

accaggcatt gcactaaaag 
<210> 201 
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<211> 22 
<212> DNA 

<213> Homo sapiens 



<400> 201 

gatgggtcac actaacctgt ca 



22 



<210> 202 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 202 

acatttatat ttggacatgc aacc 24 

<210> 203 
<211> 22 
<212> DNA 

<213> Homo sapiens 



<210> 204 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 204 

atgtgctggg ctggaaag 18 

<210> 205 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 205 

tcacattcaa aaatcggcaa 2 0 

<210> 206 

<211> 18 

<212> DNA 

<213> Homo sapiens 



<210> 207 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 207 

tgttttattt ctcagtacaa agcca 25 



<400> 203 

agcatcttta atgtgtcagg ca 



22 



<400> 206 

ctgcctgtgt ggtgtcgc 



18 
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<210> 208 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 208 

gacctcctgt gacaccacg 

<210> 209 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 209 

ccaccaaatt atttatagtt ctgcg 

<210> 210 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 210 

gtaagattct ccactgttgc acc 

<210> 211 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 211 

cctataatgg gctggaccaa 

<210> 212 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 212 

actcctcatg tgaagtcacc g 

<210> 213 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 213 

cagtgtgcac gttttcattt 

<210> 214 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 214 
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cagcatcttc agcacttacc 



20 



<210> 215 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 215 

ctgcatttat tatgagaatc aacag 25 

<210> 216 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 216 

tgctgctggg agtcagagtc 20 

<210> 217 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 217 

cagggcactg agatacactt acc 23 

<210> 218 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 218 

aaggatcaag ccaggcattt g 21 

<210> 219 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 219 

acacatctct tctgtgcccc 2 0 

<210> 220 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 220 

tgaaccctgg aggcagag 18 

<210> 221 
<211> 19 
<212> DNA 

<213> Homo sapiens 
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<400> 221 

cattccccag tttgcagac 



<210> 222 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 222 

gtgctgggat tacaggtgt 

<210> 223 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 223 

gcagagaagt cctgttagcc 

<210> 224 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 224 

ccatgctaga gaagcacaac 

<210> 225 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 225 

agtgtggggc aggacctctg 

<210> 226 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 226 

cagacagata gccctgggtt c 

<210> 227 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 227 

tccctcatcc ccttgtctgt 

<210> 228 
<211> 19 
<212> DNA 
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<213> Homo sapiens 



<400> 228 

agcccccctg gggataatc 



19 



<210> 229 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 229 

gatgcttacc taccacggc 19 

<210> 230 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 230 

aggattccta tctgggctat g 21 

<210> 231 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 231 

tggcagacca tgctccgcct 2 0 

<210> 232 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 232 

gagaaggccg ggaggctctg 20 

<210> 233 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 233 

ctccatcaca accagatttg aggct 25 

<210> 234 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 234 

gggt^tg^gc tgctgctgaa gg 22 
<210> 235 
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<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 235 

agtgggaaac ctcaggtagc tcccg 

<210> 236 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 236 

cagtttggct cagacatatg ggggc 

<210> 237 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 237 

cattagtagt ggggggacag 

<210> 238 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 238 

caaagcgaca gtgagttagg g 

<210> 239 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 239 

ggagtagacc atgattactg 

<210> 240 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 240 

catggtctat ttattctcg 

<210> 241 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 241 

cgccctggat cctcacacta ca 
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<210> 242 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 242 

gggcatcagg ggatgggtag a 21 

<210> 243 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 243 

gctcctatct gtgttttgaa tgg 23 

<210> 244 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 244 

ccgtggcata gataagtaaa eg 22 

<210> 245 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 245 

ettggagege tatgaggagg gc 22 

<210> 246 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 246 

atggcaactg accttccgtc ctg 23 

<210> 247 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 247 

ttggagtcac aggggc 16 

<210> 248 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 248 
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cagcactatc cttgggg 

<210> 249 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 249 

aacaaagctg cttagcacct g 

<210> 250 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 250 

gatgaggacc aactggtgac 

<210> 251 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 251 

ttttccaata atgtgacttc 

<210> 252 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 252 

caatcccaac cgtaacaggc 

<210> 253 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 253 

cttgatctcg cccaggaac 

<210> 254 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 254 

gctcgctgaa ggatgaagac 

<210> 255 

<211> 17 

<212> DNA 

<213> Homo sapiens 
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<400> 255 

gaatcgcttg aacccag 17 

<210> 256 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 256 

ccaggtggtc ttaacgg ' 17 

<210> 257 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_f eature 
<222> 8 

<223> n a A,T,C or G 
<400> 257 

gaacgttntt catgtaggcg t 21 

<210> 258 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 258 

taatggtcgc tgtccc 16 

<210> 259 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 259 

agggaaaatg gtatgtgggg ag 22 

<210> 260 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 260 

gcagtgtgtg aaggcagg 18 

<210> 261 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 261 

agtggacaaa atgaggaaaa cagg 24 
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<210> 262 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 262 

ccaacacagt ttgctcacat gcc 

<210> 263 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 263 

tgacatcttt gcattatggc 

<210> 264 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 264 

agttatccca cctgataccg 

<210> 265 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 265 

agctcttgct tctcagtcca 

<210> 266 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 266 

caaaagttgt ttctgtgttt gttc 

<210> 267 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 267 

gcctctcaaa gtagttggaa cc 

<210> 268 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 268 
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tgtgtatcca tagtgcaaaa cag 



23 



<210> 269 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 269 

ctcaaggcca ggcatcact 19 

<210> 270 
<211> 19 
<212> DNA 

<213> Homo sapiens 



<210> 271 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 271 

aatgatgatc tcaactctg 19 

<210> 272 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 272 

actgaagaac tcttgtcct 19 

<210> 273 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 273 

gacatctgtt agtctcataa ttc 23 

<210> 274 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 274 

ggtaacagtg tcttgctt 18 

<210> 275 
<211> 21 
<212> DNA 

<213> Homo sapiens 



<400> 270 

ggactcttcc atgccagtg 



19 
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<400> 275 

ctatgtacaa aacaggaaga g 21 

<210> 276 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 276 

atcctagttt cctctcctt 19 

<210> 277 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 277 

gtaaatgaga aacagacaaa tga 23 

<210> 278 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 278 

ctattggatg tgatatgtta tgg 2 3 

<210> 279 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 279 

aagtagaaac aaaatgaggg ac 22 

<210> 280 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 280 

cctaccccaa ggtaacag 18 

<210> 281 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 281 

acttcctata aatggaggtg ag 22 

<210> 282 
<211> 18 
<212> DNA 
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<213> Homo sapiens 



<400> 282 

gaggagcttc aagaggaa 



18 



<210> 283 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 283 

catactccta gactcaagga ate 23 

<210> 284 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 284 

gaatgatgta catgaattct ttg 23 

<210> 285 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 285 

gtgttgagga gaaaagcact 20 

<210> 286 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 286 

ctcccagtag tcacattcc 19 

<210> 287 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 287 

caagttacaa ataacttaag ccg 23 

<210> 288 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 288 

caagacccta tctctacaaa aac 23 
<210> 289 
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<211> 24 
<212> DNA 
<213> Homo 



sapiens 



<400> 289 
tttattagaa 



gtgactcttg gccc 



24 



<210> 290 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 290 

gactacctgc cctcagcttg 2 0 

<210> 291 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 291 

ttctcatgta caaagcggtc 2 0 

<210> 292 

<211> 20 

<212> DNA 

< 2 1 3 > Homo s ap i ens 

<400> 292 

ccactggctt ctctcttttt 20 

<210> 293 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 293 

caccagaagg ttggggtg 18 

<210> 294 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 294 

actattacga catgaacgcg g 21 

<210> 295 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 295 

ctcatgctgg atgacccc 18 
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<210> 296 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 296 

ttgcctttct tgaaacttaa ttcc 24 

<210> 297 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 297 

tcacagcctt cagtcaggg 19 

<210> 298 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 298 

acatgctgtg gcaccatg 18 

<210> 299 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 299 

cctgagctac tgccacag 18 

<210> 300 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 300 

ccctgacttg gacagtgtcc 20 

<210> 301 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 301 

tcagagtcac tcctgccc 18 

<210> 302 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 302 
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caaattcaag ctcatccaga cc 

<210> 303 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 303 

cggcatttca tccaggac 

<210> 304 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 304 

ggtgtaggag gtgcgacaat 

<210> 305 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 305 

ttccatttat tgagcacctg 

<210> 306 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 306 

cttaagccac tgtgttttgg 

<210> 307 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 307 

cctcctacac ctgcaaaagc 

<210> 308 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 308 

tggaagaacc ccagaggac 

<210> 309 

<211> 23 

<212> DNA 

<213> Homo sapiens 
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<400> 309 

aaagcacaaa agtaacagca aca 



23 



<210> 310 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 310 

gtgtgtgggc cacaatattg 2 0 

<210> 311 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 311 

agagcacctt tcctcagcac 20 

<210> 312 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 312 

agaatctcat cacaggggcg 2 0 

<210> 313 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 313 

aaaaaggaca gtgtctaaaa ttfcga 25 

<210> 314 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 314 

aattgttttt gtttgttttt tgagt 25 

<210> 315 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 315 

gatttaggga gtacaagtgc gg 22 

<210> 316 
<211> 25 
<212> DNA 
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<213> Homo sapiens 
<400> 316 

ggggacaaat tatactttat tcagg 

<210> 317 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 317 

ccatcatcat attggtgtga cc 

<210> 318 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 318 

tggctgccca agaagaag 

<210> 319 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<40O> 319 

ttaagatgcc attaaactca tgac 

<210> 320 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 320 

ccaaggagat gaccaagtgg 

<210> 321 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 321 

ccatctcttt tatcagggtt gg 

<210> 322 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 322 

ctctgtgcaa gtaagcatct taca 
<210> 323 
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<211> 20 
<212> DNA 
<213> Homo 



sapiens 



<400> 323 
cgactgtgta 



ttttccacag 



20 



<210> 324 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 324 

agaagcccat atcaatgcac 2 0 

<210> 325 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 325 

agcttaaagt aggacaacca tgg 2 3 

<210> 326 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 326 

ggatgcttca ctccagaaag 2 0 

<210> 327 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 327 

tgttgtttat ttccacctgc c 21 

<210> 328 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 328 

agagtggctg caggccag 18 

<210> 329 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 329 

tttttttttt tacacgaatt tgagg 25 
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<210> 330 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 330 

tgaggaagta aaaacaggtc ate 

<210> 331 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 331 

atgaaatctt aagcagaatc cca 

<210> 332 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 332 

cacagagtcc cagggtctgt 

<210> 333 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 333 

aaaggccttt atttatctct ctctg 

<210> 334 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 334 

gcctcagagc tggtgggt 

<210> 335 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 335 

gcttctaagt cttagagtca gctgg 

<210> 336 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 336 
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agcccacagt cagcctacc 



19 



<210> 337 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 337 

ttggttaaat gatgcccaga 2 0 

<210> 338 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 338 

tggtcccact cacatccc 18 

<210> 339 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 339 

acacagcatg cagggagag 19 

<210> 340 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 340 

atccctggtg cttaggtgg 19 

<210> 341 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 341 

gatggaagta gctcctctcg g 21 

<210> 342 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 342 

ggaaggccag caagtactac c 21 

<210> 343 

<211> 18 

<212> DNA 

<213> Homo sapiens 
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<400> 343 

ccggtgcttg gaaagatg 

<210> 344 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 344 

gaagtgtctc tgttggggga 

<210> 345 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 345 

ttacaggcat gagtcactac gc 

<210> 346 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 346 

accactctca cagcccttac a 

<210> 347 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 347 

ccctccctcc acacacac 

<210> 348 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 348 

gctcactgaa ctttcagggc 

<210> 349 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 349 

agatacgggc aaaacactgg 

<210> 350 
<211> 21 
<212> DNA 
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<213> Homo sapiens 



<400> 350 

gttgaatata gagcagggcc c 



21 



<210> 351 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 351 

ttctgaggtc agggctgtct 20 

<210> 352 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 352 

agcttggaaa atctcgtgtc a 21 

<210> 353 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 353 

actcagtccc tcccaccc 18 

<210> 354 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 354 

tcctctcact ccttcccaga 20 

<210> 355 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 355 

gtgatcacgg ctcaacctg 19 

<210> 356 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 356 

tggaggactg cttgagcc 18 
<210> 357 
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<211> 19 

<212> DMA. 

<213> Homo sapiens 

<400> 357 

ctgcagctgc ctcagtttc 

<210> 358 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 358 

tcaaaagtgc tggtgacagc 

<210> 359 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 359 

atttccagag ccagctcaaa 

<210> 360 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 360 

ctttaatgtt gtgatgacac aaagc 

<210> 361 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 361 

gatcatgcac tgttgaccac 

<210> 362 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 362 

tacatttgaa acatttaaaa cctga 

<210> 363 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 363 

aactgagctg taaccagact ggga 
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<210> 364 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 364 

tggaacagtc tggtcctgat gg 22 

<210> 365 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 365 

ttatcccttt attgtttctc ctttg 25 

<210> 366 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 366 

tggtcacctg tatttattgc tagg 24 

<210> 367 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 367 

tcttcaaagc ctctgcagta cc 22 

<210> 368 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 368 

ctcatctcca acctgtctaa cc 22 

<210> 369 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 369 

gtggctgcag ctaatgtaag acac 24 

<210> 370 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 370 
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<210> 371 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 371 

ctgattgaga accagaacag 

<210> 372 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 372 

taaagcccta taacctctcc 

<210> 373 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 373 

tagtaaggga ccttcaccag 

<210> 374 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 374 

agatgtttgg tatgacttgg 

<210> 375 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 375 

gatgattaaa ctctcctggc 

<210> 376 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 376 

gagacagcta agcactcatg 

<210> 377 

<211> 19 

<212> DNA 

<213> Homo sapiens 




PCT/US02/14877 

24 

20 
20 

20 

20 

20 
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<400> 377 

gaggtggtgg gcacctgta 



19 



<210> 378 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 378 

agaggggagg aacacacctt 2 0 

<210> 379 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 379 

gaccagagtc tgcccagaag 2 0 

<210> 380 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 380 

tccccagctc tatcccaac 19 

<210> 381 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 381 

99Tagggatgg acaagtctga 20 

<210> 382 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 382 

gtccagctcg ctgactatcc 2 0 

<210> 383 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 383 

tcaaaacaca gtcatctcca 20 

<210> 384 
<211> 20 
<212> DNA 
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<213> Homo sapiens 
<400> 384 

gcaaaggctt taccatattg 

<210> 385 
<211> 16 
<212> DNA 

<213> Homo sapiens 

<400> 385 
gctcagcacc cccatt 

<210> 386 
<211> 16 
<212> DNA 

<213> Homo sapiens 

<400> 386 
tccctgctcg ggaaac 

<210> 387 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 387 

gttctccaga gagacagcac 

<210> 388 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 388 

gagagcaaca ctattgccc 

<210> 389 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 389 

tatagacttc agccctgctg c 

<210> 390 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 390 

cctctgtagg atgcagttgg 
<210> 391 
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<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 391 

ttgctacgca ctcctctact 



20 



<210> 392 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 392 

gtgaaggcag gaaatgtgac 2 0 

<210> 393 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 393 

atcctagacc agaggagccc 20 

<210> 394 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 394 

ctccccctgg tccagttatt 20 

<210> 395 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 395 

aactttcatt tgccaaggga 2 0 

<210> 396 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 396 

agcagatctg ctcttgcgat 2 0 

<210> 397 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 397 

acagttgtca tcggtaggca 2 0 
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<210> 398 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 398 

aaaagtatga atgggatgga gc 

<210> 399 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 399 

gtgcaggtgg cgtttatttt 

<210> 400 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 400 

ccctatatct ccgtgtgctc c 

<210> 401 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 401 

gctctagtgg gaaacctcag g 

<210> 402 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 402 

gaattccagg ctcttgcttg 

<210> 403 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 403 

ggtttggtct caaaggcaaa 

<210> 404 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 404 
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ccagtacatg gtggtcacca 



20 



<210> 405 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 405 

gctgccttgg aatttctgtt 2 0 

<210> 406 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 406 

gtgctgtggt ggggaaag 18 

<210> 407 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 407 

attcaagctc atccagaccc 20 

<210> 408 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 408 

ggactggccc tttgaaactc 20 

<210> 409 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 409 

atattgaccg tgcacaaata eg 22 

<210> 410 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 410 

agacctggga aaagtggaga a 21 

<210> 411 
<211> 20 
<212> DNA 

<213> Homo sapiens 
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<400> 411 

attggcagtg gaaaatgctt 

<210> 412 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 412 

ttaatctttt gtcaacttcc tgatt 

<210> 413 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 413 

tctgtcctcc tttcaccgga age 

<210> 414 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<400> 414 

ggataaagaa actccgctct gctggtaga 

<210> 415 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 415 

teagggectg tgttgccgca ctctg 

<210> 416 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 416 

agcgatgtaa agggtaccag tgccg 

<210> 417 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 417 

aggcatgeaa gcttctta 

<210> 418 
<211> 18 
<212> DNA 
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<213> Homo sapiens 



<400> 418 

ccgggaggag acatctat 



18 



<210> 419 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 419 

tggtaagcac agaaaatgc 19 

<210> 420 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 420 

aatggatggg ggattatt 18 

<210> 421 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 421 

ctggacgtta tgtctgcc 18 

<210> 422 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 422 

agaggcccag tcacagat 18 

<210> 423 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 423 

atcactctga actgccact 19 

<210> 424 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 424 

cccttctgtt tttctgtttt 20 
<210> 425 
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<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 425 

caagctttga aggaagag 

<210> 426 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 426 

taggacgtta agtgaggac 

<210> 427 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 427 

gctctgcagt gggtaaaa 

<210> 428 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 428 

actctccaag actgtgcg 

<210> 429 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 429 

ccctttctga ggcaagat 

<210> 430. 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 430 

gaccacctgg gagagaac 

<210> 431 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 431 

cgctatgagt cccatctg 



19 



18 



18 



18 



18 



18 
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<210> 432 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 432 

gat cage tgc aatgaagg 18 

<210> 433 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 433 

ttgagtacac ggggtgac 18 

<210> 434 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 434 

cgcaggactg aaagatga 18 

<210> 435 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 435 

acctgtctcc tctcctgg 18 

<210> 436 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 436 

tgettttett ctgtggga 18 

<210> 437 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 437 

atgaccagca agcattgt 18 

<210> 438 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 438 
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gtactgggat tacaggcg 

<210> 439 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 439 

gcagaaggtc ctttggat 

<210> 440 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 440 

tttgcaggat tcatgctt 

<210> 441 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 441 

cgacattctt ttctggagg 

<210> 442 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 442 

acctttgcat gttggtttt 

<210> 443 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 443 

gcacttttcc ttccttcc 

<210> 444 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 444 

tgctttgctt tcttctgg 

<210> 445 
<211> 19 
<212> DNA 

<213> Homo sapiens 
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<400> 445 

acagctccag agagaagga 



19 



<210> 446 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 446 

gcagtcactt gaaaccaga 19 

<210> 447 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 447 

aggcatcaag ctttcctt 18 

<210> 448 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 448 

ggtttagaga accgagcc 18 

<210> 449 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 449 

gtggtgctgc aagttacc 18 

<210> 450 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 450 

ggaatccctt tctttcca 18 

<210> 451 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 451 

gaccatttgt tacgcagc 18 

<210> 452 
<211> 19 
<212> DNA 
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<213> Homo sapiens 



<400> 452 

gatgggtgtg aatgaacaa 



19 



<210> 453 
<211> 19 
<212> DNA 

<213> Homo sapiens 



<400> 453 

ctcaagcttc tgttcatgc 



19 



<210> 454 
<211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 454 

gctgtgagtg tcttggct 



18 



<210> 455 
<211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 455 

tacagaaaac cgcagctc 



18 



<210> 456 
<211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 456 

gccaccaaag gaaagatt 



18 



<210> 457 

<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 457 

aaaaggaggg aatcatgg 



18 



<210> 458 
<211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 458 

tcacttagca ggaggcag 



18 



<210> 459 
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<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 459 

ctgagcatcc gatgagac 



18 



<210> 460 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 460 

gtgcaaaatg agcagctt 18 

<210> 461 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 461 

tctaacccct tactgggc 18 

<210> 462 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 462 

tcctcaaact gggaatga 18 

<210> 463 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 463 

tttacacagg accaggga 18 

<210> 464 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 464 

atctccccca ctcagaag 18 

<210> 465 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 465 

gtccacgggc tttattct 18 
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<210> 466 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 466 

tgagcataaa tttcattagc tg 22 

<210:> 467 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 467 

ggaagagcaa aataaatcca 2 0 

<210> 468 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 468 

ggtgcacaga attgttcat 19 

<210> 469 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 469 

agcacgctta tttcatgg 18 

<210> 470 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 470 

gtaacaccag cagggaca 18 

<210> 471 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 471 

tcctgctgca ttatggat 18 

<210> 472 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 472 



-167- 



BNSDOCID: <WO 



.02092000A2_I_> 





WO 02/092000 



PCT/US02/14877 



gggggtgaga agtaggaa 



18 



<210> 473 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 473 

atggggatta aatacggg 18 

<210> 474 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 474 

agctagcatt gggctctt 18 

<210> 475 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 475 

ctgaggagaa gaggctgg 18 

<210> 476 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 476 

cgccttacaa ggcaagta 18 

<210> 477 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 477 

aggatgcttg ctagggtt 18 

<210> 478 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 478 

cacaagtgtc tggaaggc 18 

<210> 479 

<211> 18 

<212> DNA 

<213> Homo sapiens 
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<400> 479 

ggtctcagga gcccttta 

<210> 480 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 480 

acatgccact cttctcacta a 

<210> 481 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 481 

acttaaccaa ggatgggg 

<210> 482 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 482 

caacccacga gcataaga 

<210> 483 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 483 

taggctctgc actcttgg 

<210> 484 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 484 

acccacggag tctctctc 

<210> 485 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 485 

taaaggcggt gaagtgag 

<210> 486 
<211> 18 
<212> DNA 
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<213> Homo sapiens 



<400> 486 

ctaccgctct cctaggct 



18 



<210> 487 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 487 

tggggccaga taattctt 18 

<210> 488 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 488 

ctggtgtttg gtggtgtt 18 

<210> 489 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 489 

aaggaagagg tcaccagg 18 

<210> 490 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 490 

cacaaattcc atttccca 18 

<210> 491 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 491 

tcaataggtg atccaacatt t 21 

<210> 492 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 492 

aaagtcccac aaagggtc 18 
<210> 493 
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<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 493 

gggtaggggg atcttttt 

<210> 494 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 494 

tgtggaacat tcattggc 

<210> 495 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 495 

gtcctgggaa agatggaa 

<210> 496 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 496 

tcaaagcgtc tcccataa 

<210> 497 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 497 

tctttcgctg tacttggc 

<210> 498 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 498 

tgggaggtca gagtgatg 

<210> 499 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 499 

ggacagtgta tgtgttggg 
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<210> 500 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 500 

aggcagctgt ttttgtga 18 

<210> 501 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 501 

cttcttgagt cccgtgtg 18 

<210> 502 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 502 

caaccgagaa tcctctagc 19 

<210> 503 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 503 

gctgggagag aatcacaa 18 

<210> 504 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 504 

gctttgcaga agagacca 18 

<210> 505 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 505 

acgctgtcag gtcacact 18 

<210> 506 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 506 
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ggaggatgct caggtgat 



18 



<210> 507 
<211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 507 

tagggggatc tttttcca 



18 



<210> 508 
<211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 508 

gagcaatttg aaaagcca 



18 



<210> 509 
<211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 509 

atggtccagc tcctctgt 



18 



<210> 510 
<211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 510 

atagagcacc ccatctcc 



18 



<210> 511 
<211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 511 

aacattgctg ttagccca 



18 



<210> 512 
<211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 512 

gcaatcgaaa cagcattc 



18 



<210> 513 
<211> 18 
<212> DNA 

<213> Homo sapiens 



-173- 



BNSDOCID: <WO 02092000A2_I_> 





WO 02/092000 



PCT/US02/14877 



<400> 513 

atgagttggc agctgaag 



18 



<210> 514 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 514 

aatgaaggtc ttgcctcc 18 

<210> 515 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 515 

gaggagaaga tccacaagcg 2 0 

<210> 516 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 516 

tctctggggc atactgaacc 20 

<210> 517 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 517 

ctgagctttt ggcactgt 18 

<210> 518 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 518 

ctgctaggtg acagcagg 18 

<210> 519 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 519 

tgtatgagtc tggagggtgt 20 

<210> 520 
<211> 18 
<212> DNA 
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<213> Homo sapiens 
<400> 520 

acacctggct gaggaaat 

<210> 521 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 521 

gcaggggacg tgataata 

<210> 522 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 522 

ttttgcttcc taccatgc 

<210> 523 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 523 

aaaattgtga gcacctcc 

<210> 524 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 524 

tttatattta aagtggcttt gtt 

<210> 525 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 525 

gtgcaaagcc cacagtat 

<210> 526 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 526 

aggaaaatgc aagagcag 
<210> 527 
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<211> 20 
<212> DNA 

<213> Homo sapiens 



<400> 527 

ccactgaatt gcatactttg 



20 



<210> 528 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 528 

tctgggtcca gtctgcta 18 

<210> 529 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 529 

agattttggg gagtcagg 18 

<210> 530 
<211> 17 
<212> DNA 

<213> Homo sapiens 
<400> 530 

gcgctcaagc aattctc 17 

<210> 531 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 531 

caagccccaa agtagtca 18 

<210> 532 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 532 

gaatcatcca atccacga 18 

<210> 533 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 533 

agcctccagg tgactacc 18 
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<210> 534 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 534 

gaaggacatg gtcagcag 18 

<210> 535 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 535 

atgctttcag cattttcg 18 

<210> 536 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 536 

tgatccgtgg tagggtta 18 

<210> 537 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 537 

gtcggattgg tttcacaa 18 

<210> 538 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 538 

ttttatggga atttcagcc 19 

<210> 539 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 539 

tttggaaaag aacagaaatg t 21 

<210> 540 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 540 



-177- 



BNSDOCID: <WO. 



_02092000A2_I_> 





WO 02/092000 



PCT/US02/14877 



ggctagtctt 



tcctgaacc 



19 



<210> 541 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 541 

ccttaatgcc cctgattc 18 

<210> 542 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 542 

gcgtttacaa gctgagga 18 

<210> 543 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 543 

tcaagcttgc tttctcaa 18 

<210> 544 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 544 

gtagcccagc aagtgtct 18 

<210> 545 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 545 

cctggctgga gataggat 18 

<210> 546 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 546 

cttcccctct gcctatgt 18 

<210> 547 

<211> 18 

<212> DNA 

<213> Homo sapiens 
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<400> 547 

ggcacgtact tcctacca 

<210> 548 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 548 

ggtgcttctt acaggcaa 

<210> 549 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 549 
acccaggctg gtgtgt 

<210> 550 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 550 

actgagttaa ttatcactcc cct 

<210> 551 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 551 

gatgcatttt gcttcacc 

<210> 552 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 552 

tctgctttta gagctgttag c 

<210> 553 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 553 

tcaagcttca aagagcaga 

<210> 554 
<211> 18 
<212> DNA 
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<213> Homo sapiens 



<400> 554 

ggagtacatc ccaggacc 



18 



<210> 555 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 555 

tggtgctttt aaatccaga 19 

<210> 556 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 556 

ctcccttact tacttgcatt g 21 

<210> 557 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 557 

tcttctccca gggaatct 18 

<210> 558 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 558 

tttatgtccc ctgagcac 18 

<210> 559 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 559 

tccctggcta tcttgaatc 19 

<210> 560 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 560 

cttgactggg tccacg 16 

<210> 561 
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<211> 20 
<212> DNA 
<213> Homo 



sapiens 



<400> 561 
cgagacgcca 



gtagatacca 



20 



<210> 562 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 562 

catcctccat gcctttcagt 20 

<210> 563 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 563 

agttccagag aacgagacgc 2 0 

<210> 564 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 564 

cttgtcatcc tccatgcctt 20 

<210> 565 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 565 

gagcgtgaga ggttgaggag 2 0 

<210> 566 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 566 

aaacaaactc cagacgcacc 2 0 

<210> 567 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 567 

ctgaaccact acctgtatga cctg 2 4 
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<210> 568 
<211> 25 
<212> DNA 

<213* Homo sapiens 
<400> 568 

ctaactactt actcctacag ggccc 25 

<210> 569 
<211^ 23 
<212> DNA 

<213> Homo sapiens 
<400> 569 

gaagcatttc aatactttaa ctg 23 

<210> 570 
<211> 20 
<212> DNA 

<213> Homo sapiens 



<210> 571 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 571 

cttctcctgg ccactctgac 2 0 

<210> 572 
<211> 21 
<212> DNA 

<213> Homo sapiens 



<210> 573 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 573 

tgaggatgaa tgagcacata gg 22 

<210> 574 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 574 



<400> 570 

ccactccagt gcacccaatc 



20 



<400> 572 

ggtttacctt tgaatcccag c 



21 
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tttgtggtcc attgagtagg c 

<210> 575 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 575 

aggggaagga atgtgcttgg 

<210> 576 

<211i> 20 

<212:> DNA 

<213> Homo sapiens 

<400> 576 

ttcggctgag cgggcagtgt 



<210> 577 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 577 

attgaaggtc ctccaaaaga atgctg 

<210> 578 

<211> 30 

<212> DNA 

<213> Homo sapiens 

<400> 578 

agaacgtcaa catatctttt tgggggacac 

<210> 579 

<211> 22 

<212> DNA 

<213> Homo sapiens 



26 



30 



<400> 579 

ttgtatttga ggactttgct eg 

<210> 580 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 580 

cggtaccatc ctcctcttcc 

<210> 581 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<400> 581 

tttttgcctc atctatgccc 



20 



<210> 582 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 582 

gggtgacaga gcaagactcc 2 0 

<210> 583 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 583 

ttgctcaagt tctcctgg 18 

<210> 584 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 584 

accttgtttt gaggggag 18 

<210> 585 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 585 

cttggctatt tggacagc 18 

<210> 586 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 586 

gggcatttac tcacttgc 18 

<210> 587 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 587 

cttgtgtcag ttgtcaggg 19 

<210> 588 
<211> 19 
<212> DNA 
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<213> Homo sapiens 
<400> 588 

tggaattgtt gtgtcttgg 

<210> 589 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 589 

ccagttccac tggatgtt 

<210> 590 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 590 

atgggctgtg tttctcaa 

<210> 591 

<211> 18 

<212> DNA 

<213 > Homo sapiens 

<400> 591 

ctgcctatcc ctggactt 

<210> 592 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 592 

agtttgtccc tagtgccc 

<210> 593 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 593 

caacacgtct gacatccat 

<210> 594 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 594 
ggatagtgca caccca 

<210> 595 
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<211> 22 

<212> DNA 

<213> Homo sapiens 



<400> 595 

tgggtggtac tattgttccc at 



22 



<210> 596 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 596 

agttccagcc cccttaccag 20 

<210> 597 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 597 

ggccactatc atccctgtgt 2 0 

<210> 598 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 598 

tttcacatgg gaagaacacg 2 0 

<210> 599 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 599 

acagtgacac tagggacggg 2 0 

<210> 600 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 600 

tgccaggatg gagataacaa 2 0 

<210> 601 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 601 

cctgtggcac acatatcacc 20 
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<210> 602 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 602 

acaaccaaga atggagccac 2 0 

<210> 603 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 603 

tgctgtgtaa caagtcccca 2 0 

<210> 604 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 604 

tgaacggagg acctaccaag 20 

<210> 605 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 605 

gcagggtccg actcactaag 20 

<210> 606 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 606 

gctgtgagtt ccctttacgc 2 0 

<210> 607 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 607 

acagtgggga caaagacagg 2 0 

<210> 608 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 608 
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tacagggcac ctcccagtag 



20 



<210> 609 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 609 

tcttctgtta aggtttcccc c 21 

<210> 610 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 610 

tgtctcaaac ctccctctgc 20 

<210> 611 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 611 

aacatatttc ctccccagcc 20 

<210> 612 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 612 

cagtcccagc caatgagaa 19 

<210> 613 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 613 

ctcctctgca tgggagaatc 2 0 

<210> 614 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 614 

agacctggga ccagtctgtg 2 0 

<210> 615 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<400> 615 

gggagacgac gtcacaagat 20 

<210> 616 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 616 

tgatgttggg aagatggtga 20 

<210> 617 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 617 

caggcatctt ctatgtgcca 2 0 

<210> 618 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 618 

gggaggcaca agttcfcttca 20 

<210> 619 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 619 

acttcgtggc actgagtgtg 20 

<210> 620 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 620 

cctttcttac ggatgaggca 20 

<210> 621 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 621 

ggctgctgag ctcttctgat 20 

<210> 622 
<211> 20 
<212> DNA 
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<213> Homo sapiens 



<400> 622 

tgggtctctc tgcctgactt 



20 



<210> 623 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 623 

tcacctactt ccagcttccg 

<210> 624 
<211> 20 
<212> DNA 

<213> Homo sapiens 



20 



<400> 624 

agacctggga ccagtctgtg 



20 



<210> 625 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 625 

ctcctctgca tgggagaatc 



20 



<210> 626 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 626 

aattcaggag acctgggacc 



20 



<210> 627 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a BstXI-linker adapter. 

<400> 627 
gtcttcacca cgggg 



15 



<210> 628 
<211> 11 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Artificial Sequence is a BstXI- linker adapter. 



<400> 628 
gtggtgaaga c 



11 



<210> 629 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 629 

ccaagttctg agaagtcc 18 

<210> 630 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Artificial Sequence is a primer. 
<400> 630 

aatacctgaa accatac 17 

<210> 631 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is an allele specific oligonucleotide. 
<400> 631 

agactggggt gagacgc 17 

<210> 632 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is an allele specific oligonucleotide. 
<400> 632 

cagactgggt tgagacgcc 19 

<210> 633 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Artificial Sequence is a primer. 
<400> 633 

cccgtgtgct ccgccgccca gttc 24 

<210> 634 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 634 

ggctcacgga gctcatcatg gactt 2 5 

<210> 635 
<211> 502 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 



<400> 635 

cccgtgtgct 

cgctgcgacg 

tgcctgccca 

gactccttcc 

ccctcagacg 

tctctcttcg 

gcgggggcca 

aatttcatag 

tccatgatga 



ccgccgccca 
gcgaggcaga 
accagttccg 
ccgactgtat 
acagcccggc 
tcatgggtgg 
acgggccctt 
ccccgggcgg 
gctccgtgag 



gttcccctgc 
ctgtcaggac 
gtgtgcgagc 
cgacggctcc 
ccacagcagt 
tgtctatttt 
cccgcacgag 
ttcccagcat 
cc 



gcgcggggtc 
cgctcagacg 
ggccagtgtg 
gacgagctca 
gccatcgggc 
gtgtgccagc 
tatgtcagcg 
ggccccttca 



agtgtgtgga 
aggtggactg 
tcctcatcaa 
tgtgtgaaat 
ccgtcattgg 
gcgtggtgtg 
ggaccccgca 
caggcatcgc 



cctgcgcctg 60 
tgacgccatc 120 
acagcagtgc 180 
caccaagccg 240 
catcatcctc 300 
ccagcgctat 360 
cgtgcccctc 420 
atgcggaaag 4 80 
502 



<210> 636 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 
<400> 636 

agcgaggcca ccatccacag g 

<210> 637 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 



21 
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<400> 637 

tcgctggtcg gcataatcaa t 21 

<210> 638 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a primer. 



<400> 638 

agcagagcca ccatccacag gatctccctg gagactaaca acaacgatgt ggctatccca 60 
ctcacgggtg tcaaagaggc ctctgcactg gactttgatg tgtccaacaa tcacatctac 12 0 
tggactgatg ttagcctcaa gacgatcagc cgagccttca tgaatgggag ctcagtggag 180 
cacgtgattg agtttggcct cgactaccct gaaggaatgg ctgtggactg gatgggcaag 240 
aacctctatt gggcggacac agggaccaac aggattgagg tggcccggct ggatgggcag 3 00 
ttccggcagg tgcttgtgtg gagagacctt gacaacccca ggtctctggc tctggatcct 3 60 
actaaaggct acatctactg gactgagtgg ggtggcaagc caaggattgt gcgggccttc 420 
atggatggga ccaattgtat gacactggta gacaaggtgg gccgggccaa cgacctcacc 480 
attgattatg ccgaccagcg a 501 

<210> 639 
<211> 26 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a Zmaxl oligonucleotide. 



<400> 639 

raguacagcu ucuugccaac ccaguc 26 

<210> 640 
<211> 26 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a Zmaxl oligonucleotide. 
<400> 640 

ruccuccagg ucgaugguca gcccau 26 

<210> 641 
<211> 26 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence is a Zmaxl oligonucleotide. 
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<400> 641 

rgucugaguc cgaguucaaa uccagg 



26 



<210> 642 

<211> 3081 

<212> DNA 

<213> Homo sapiens 

<400> 642 

cccgccagcc cagcccagcc caaccctact ccctccccac gccagggcag cagccgttgc 60 

tcagagagaa ggtggaggaa gaaatccaga ccctagcacg cgcgcaccat catggaccat 12 0 

tatgattctc agcaaaccaa cgattacatg cagccagaag aggactggga ccgggacctg 18 0 

ctcctggacc cggcctggga gaagcagcag agaaagacat tcacggcatg gtgtaactcc 240 

cacctccgga aggcggggac acagatcgag aacatcgaag aggacttccg ggatggcctg 300 

aagctcatgc tgctgctgga ggtcatctca ggtgaacgct tggccaagcc agagcgaggc 360 

aagatgagag tgcacaagat ctccaacgtc aacaaggccc tggatttcat agccagcaaa 42 0 

ggcgtcaaac tggtgtccat cggagccgaa gaaatcgtgg atgggaatgt gaagatgacc 480 

ctgggcatga tctggaccat catcctgcgc tttgccatcc aggacatctc cgtggaagag 54 0 

acttcagcca aggaagggct gctcctgtgg tgtcagagaa agacagcccc ttacaaaaat 600 

gtcaacatcc agaacttcca cataagctgg aaggatggcc tcggcttctg tgctttgatc 660 

caccgacacc ggcccgagct gattgactac gggaagctgc ggaaggatga tccactcaca 720 

aatctgaata cggcttttga cgtggcagag aagtacctgg acatccccaa gatgctggat 7 80 

gccgaagaca tcgttggaac tgcccgaccg gatgagaaag ccatcatgac ttacgtgtct 84 0 

agcttctacc acgccttctc tggagcccag aaggcggaga cagcagccaa tcgcatctgc 900 

aaggtgttgg ccgtcaacca ggagaacgag cagcttatgg aagactacga gaagctggcc 960 

agtgatctgt tggagtggat ccgccgcaca atcccgtggc tggagaaccg ggtgcccgag 102 0 

aacaccatgc atgccatgca acagaagctg gaggacttcc gggactaccg gcgcctgcac 108 0 

aagccgccca aggtgcagga gaagtgccag ctggagatca acttcaacac gctgcagacc 114 0 

aagctgcggc tcagcaaccg gcctgccttc atgccctctg agggcaggat ggtctcggac 12 00 

atcaacaatg cctggggctg cctggagcag gtggagaagg gctatgagga gtggttgctg 126 0 

aatgagat cc ggaggctgga gcgactggac cacctggcag agaagttccg gcagaaggcc 132 0 

tccatccacg aggcctggac tgacggcaaa gaggccatgc tgcgacagaa ggactatgag 13 80 

accgccaccc tctcggagat caaggccttg ctcaagaagc atgaggcctt cgagagtgac 1440 

ctggctgccc accaggaccg tgtggagcag attgccgcca tcgcacagga gctcaatgag 1500 

ctggactatt atgactcacc cagtgtcaac gcccgttgcc aaaagatctg tgaccagtgg 1560 

gacaatctgg gggccctaac tcagaagcga agggaagctc tggagcggac cgagaaactg 162 0 

ctggagacca ttgaccagct gtacttggag tatgccaagc gggctgcacc cttcaacaac 1680 

tggatggagg gggccatgga ggacctgcag gacaccttca ttgtgcacac cattgaggag 174 0 

atccagggac tgaccacagc ccatgagcag ttcaaggcca ccctccctga tgccgacaag 1800 

gagcgcctgg ccatcctggg catccacaat gaggtgtcca agattgtcca gacctaccac 1860 

gtcaatatgg cgggcaccaa cccctacaca accatcacgc ctcaggagat caatggcaaa 1920 

tgggaccacg tgcggcagct ggtgcctcgg agggaccaag ctctgacgga ggagcatgcc 198 0 

cgacagcagc acaatgagag gctacgcaag cagtttggag cccaggccaa tgtcatcggg 2 04 0 

ccctggatcc agaccaagat ggaggagatc gggaggatct ccattgagat gcatgggacc 2100 

ctggaggacc agctcagcca cctgcggcag tatgagaaga gcatcgtcaa ctacaagcca 2160 

aagattgatc agctggaggg cgaccaccag ctcatccagg aggcgctcat cttcgacaac 2220 

aagcacacca actacaccat ggagcacatc cgtgtgggct gggagcagct gctcaccacc 22 8 0 

atcgccagga ccatcaatga ggtagagaac cagatcctga cccgggatgc caagggcatc 234 0 

agccaggagc agatgaatga gttccgggcc tccttcaacc actttgaccg ggatcactcc 2400 

ggcacactgg gtcccgagga gttcaaagcc tgcctcatca gcttgggtta tgatattggc 2460 

aacgaccccc agggagaagc agaatttgcc cgcatcatga gcattgtgga ccccaaccgc 2520 

ctgggggtag tgacattcca ggccttcatt gacttcatgt cccgcgagac agccgacaca 2580 

gatacagcag accaagtcat ggcttccttc aagatcctgg ctggggacaa gaactacatt 2640 

accatggacg agctgcgccg cgagctgcca cccgaccagg ctgagtactg catcgcgcgg 2700 
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atggccccct 
acggcgctgt 
cgccgtgccc 
ttctccactg 
ctggacggtg 
gcgctgaatc 
ctaataaaat 



acaccggccc 
acggcgagag 
acagatgtga 
aaaaatcctc 
acgtcagcct 
gtatgctttt 
aaataaatac 



cgactccgtg 
tgacctctaa 
aatgaatgta 
tttctttggg 
gtacaggctc 
tccttttgat 
9 



ccaggtgctc 
tccaccccgc 
atctaataga 
gtttttcttt 
ccaggggtgg 
aaataaacaa 



tggactacat 


gtccttctcc 


2760 


ccggccgccc 


tcgtcttgtg 


2820 


agcctaatca 


gcccaccatg 


2880 


ctttcttttt 


tgattttgca 


2940 


cgtcaaatgc 


tattgaaatt 


3000 


tgtaaaaatg 


tttcaaaaac 


3060 






3081 



<210> 643 

<211> 1324 

<212> DNA 

< 2 1 3 > Homo s ap i ens 

<220> 

<221> misc_f eature 
<222> (1) . . . (1324) 
<223> n = A,T,C or G 



<400> 643 

ggccgcccgg 

ccccgccgcg 

ccagcaactc 

actgcaagct 

agagatgcag 

gcacaaacag 

cctctcccaa 

cgctcccgag 

gcaggccctg 

ggcggtcagc 

ctccaaggaa 

gtcggattag 

gcacggagag 

ccatagtatt 

ttcttcctac 

tgccctgaca 

ggagcaagac 

cgacagtctg 

tgcccctttc 

ccacctccac 

gtcctctccc 

gagcagggtg 

tact 



cgcccccagc 
attgacatga 
aaattcacca 
cagtaccaca 
cgtcactatg 
gctgagatcg 
gagcaccagc 
ctgaactcta 
gccctgccct 
gcaggcaccg 
gacaagaacg 
cagggggccg 
aaaggaatgt 
tatggtggcg 
cccattccgg 
tgtgaggcaa 
ctgcgaagcg 
ccctgggcca 
ccgtcctggg 
ctgccccact 
atcgcccaca 
agggtgggcg 



agnccgagcc 
tgtttccaca 
cctcggactc 
gcctcaagct 
tgatgtacta 
tcaaaaggct 
agcaggtctt 
tcatccgaca 
tgaccccact 
gcctcctctc 
ggcacgatgg 
ggacagggag 
ttagcacaag 
ccggcggggc 
cttccctcct 
gctaaggcct 
cagcagcccc 
cagccctctc 
caccccgagt 
caccacctct 
aaaggggggg 
acccaggatt 



ggggcgcaca 
aagcaggcat 
ctgcgaccgc 
cgaatgtgac 
cgagatgtcc 
gaacgggatt 
gggagccatt 
gcagctccaa 
acccgtgggg 
gctgtccgcg 
tgacacccac 
gttgggaggg 
acacagcgga 
cccagcccag 
cctcccctgc 
ggagggtcag 
ggcccttccc 
accctggtac 
ctcccccgac 
gctagttcca 
cacgagggac 
ccccctcccc 



gncggggcgc 


agcccgcgcc 


60 


tcgggctcct 


cgcacctacc 


120 


atcaaagacg 


aatttcagct 


180 


aagttggcca 


gtgagaagtc 


240 


tacggcttga 


acatcgagat 


300 


tgtgcccagg 


tcctgcccta 


360 


gagagggcca 


agcaggtcac 


420 


gcccaccagc 


tgtcccagct 


480 


ctgcagccgc 


cttcgctgcc 


540 


ctgggttccc 


aggcccacct 


600 


caggaggatg 


atggcgagaa 


660 


gggacagagg 


ggagacagag 


720 


gctcgggatt 


ggctaatctc 


780 


cttgcaggcc 


acctctagct 


840 


agcctggtta 


ggtggatacc 


900 


atgggagacc 


aggtcccaag 


960 


ccgttttgaa 


catgtgtaac 


1020 


tgcatgcacg 


caatgctagc 


1080 


cccgggtccc 


aggtatgctc 


1140 


gacacctcca 


cgcccacctg 


1200 


gagcttagct 


gagctgggag 


1260 


ttcccaaata 


aagatgaggg 


1320 






1324 



<210> 644 

<211> 2377 

<212> DNA 

<213> Homo sapiens 



<400> 644 

ggtgacaaag 

ttagagtctg 

gatgatggct 

gaagatgcag 

aatggtggtt 



agccaacaga 
aagttgttac 
ccagatccaa 
gagctggtga 
ttaaaccttc 



gacaatagga 
caatggtgaa 
ggatgaaaca 
aaataggaga 
tagacctcca 



gacttgtcaa 
actacatgtt 
agagtgagca 
gtcagtggga 
agaccttcac 



tttgtcttga 


tgggctacag 


60 


cagaaagtgc 


ttctcagaat 


120 


caaatggatc 


agatgaccct 


180 


ataattctcc 


atcactctca 


240 


gaccaccacc 


acccacccca 


300 
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cgtagaccag catctgtcaa tggttcacca tctgccactt ctgaaagtga tgggtctagt 360 

acaggctctc tgccgccgac aaatacaaat acaaatacat ctgaaggagc aacatctgga 420 

ttaataattc ctcttactat atctggaggc tcaggcccta ggccattaaa tcctgtaact 4 80 

caagctccct tgccacctgg ttgggagcag agagtggacc agcacgggcg agtttactat 540 

gtagatcatg ttgagaaaag aacaacatgg gatagaccag aacctctacc tcctggctgg 600 

gaacggcggg ttgacaacat gggacgtatt tattatgttg accatttcac aagaacaaca 660 

acgtggcaga ggccaacact ggaatccgtc cggaactatg aacaatggca gctacagcgt 720 

agtcagcttc aaggagcaat gcagcagttt aaccagagat tcatttatgg gaatcaagat 78 0 

ttatttgcta catcacaaag taaagaattt gatcctcttg gtccattgcc acctggatgg 84 0 

gagaagagaa cagacagcaa tggcagagta tatttcgtca accacaacac acgaattaca 90 0 

caatgggaag accccagaag tcaaggtcaa ttaaatgaaa agcccttacc tgaaggttgg 960 

gaaatgagat tcacagtgga tggaattcca tattttgtgg accacaatag aagaactacc 102 0 

acctatatag atccccgcac aggaaaatct gccctagaca atggacctca gatagcctat 1080 

gttcgggact tcaaagcaaa ggttcagtat ttccggttct ggtgtcagca actggccatg 114 0 

ccacagcaca taaagattac agtgacaaga aaaacattgt ttgaggattc ctttcaacag 12 0 0 

ataatgagct tcagtcccca agatctgcga agacgtttgt gggtgatttt tccaggagaa 12 60 

gaaggtttag attatggagg tgtagcaaga gaatggttct ttcttttgtc acatgaagtg 132 0 

ttgaacccaa tgtattgcct gtttgaatat gcagggaagg ataactactg cttgcagata 13 8 0 

aaccccgctt cttacatcaa tccagatcac ctgaaatatt ttcgttttat tggcagatfct 144 0 

attgccatgg ctctgttcca tgggaaattc atagacacgg gtttttcttt accattctat 1500 

aagcgtatct tgaacaaacc agttggactc aaggatttag aatctattga tccagaattt 156 0 

tacaattctc tcatctgggt taaggaaaac aatattgagg aatgtgattt ggaaatgtac 162 0 

ttctccgttg acaaagaaat tctaggtgaa attaagagtc atgatctgaa acctaatggt 168 0 

ggcaatattc ttgtaacaga agaaaataaa gaggaataca tcagaatggt agctgagtgg 1740 

aggttgtctc gaggtgttga agaacagaca caagctttct ttgaaggctt taatgaaatt 1800 

cttccccagc aatatttgca atactttgat gcaaaggaat tagaggtcct tttatgtgga 1860 

atgcaagaga ttgatttgaa tgactggcaa agacatgcca tctaccgtca ttatgcaagg 1920 

accagcaaac aaatcatgtg gttttggcag tttgttaaag aaattgataa tgagaagaga 198 0 

atgagacttc tgcagtttgt tactggaacc tgccgattgc cagtaggagg atttgctgat 2 04 0 

ctcatgggga gcaatggacc acagaaattc tgcattgaaa aagttgggaa agaaaattgg 2100 

ctacccagaa gtcatacctg ttttaatcgc ctggacctgc caccatacaa gagctatgag 2160 

caactgaagg aaaagctgtt gtttgccata gaagaaacag aaggatttgg acaagagtaa 222 0 

cttctgagaa cttgcaccat gaatgggcaa gaacttattt gcaatgtttg tccttctctg 22 80 

cctgttgcac atcttgtaaa attggacaat ggctctttag agagttatct gagtgtaagt 2340 

aaattaatgt tctcatttaa aaaaaaaaaa aaaaaaa 2 3 77 

<210> 645 
<211> 1295 
<212> DNA 

<213> Homo sapiens 
<400> 645 

gggccgccgc ccacccgggc cttgcctcta cctcagtcgt tgccccccga ttttcggctg 60 

gagcccacgg ccccggccct cagcccccgc tctagcttcg ccagtagctc ggccagcgac 120 

gcgagcaagc cgtccagccc ccggggcagc ctgctgctgg acggggcggg ggctggcgga 180 

gctggaggta gccggccctg cagcaatcgc accagcggca tcagcatggg ctacgaccag 24 0 

cgccacggga gccccttgcc agcggggccg tgcctgtttg gcccacccct ggccggagca 3 00 

ccggcaggct attctcccgg aggggtcccg tccgcctacc cggagctcca cgccgccctg 360 

gaccgattgt acgctcagcg gcccgcgggg ttcggctgcc aggaaagccg ccactcgtat 420 

cccccggccc tgggcagccc tggagctcta gccggggccc gagtgggagc ggcggggccc 480 

ttggagagac ggggggcgca acccggacga cactctgtga ccggctacgg ggactgcgcc 54 0 

gtgggcgccc ggtaccagga cgagctaaca gctttgcttc gcctgacggt gggcaccggt 600 

gggcgagaag ccggagcccg cggagaaccc tcggggattg agccgtcggg tctggaggag 660 

ccaccaggtc ctttcgttcc ggaggccgcc cgggcccgga tgcgggagcc agaggccagg 720 



-196- 



BNSDOCID: <WO 02092000A2J_> 



WO 02/092000 PCT/US02/1 4877 



gaggactact 
tgccaggccc 
actttgcgtt 
ctgttttcag 
gagaagatcc 
tgcaacaagt 
gtcaccgact 
ccctctgagg 
gagtgctacc 
ttccctctgg 



tcggcaccfcg 
tggacagcct 
gcaaggcttt 
ggtttcagga 
tacaagcaat 
gcctggatgg 
accacaaaaa 
gctgtgagga 
actgtgagga 
atgggcactt 



tatcaagtgc 
ctaccacacc 
ctacagtgtc 
ggcagctgag 
ggggaagtcc 
catcccctfcc 
ttatgctcct 
catcgtgagg 
ctgccggatg 
gctctgccat 



aacaaaggca 
cagtgctttg 
aatggctctg 
aaatgctgtg 
tatcatccag 
acagtggact 
aagtgtgcag 
gtgatatcca 
cagctgagtg 
ggttg 



tctatgggca 
tttgctgctc 
tgtactgtga 
tctgtggtca 
gctgtttccg 
tctccaacca 
cctgtggcca 
tggaccggga 
atgaggaagg 



gagcaatgcc 
ttgtgggcga 
ggaagattat 
cttgattttg 
atgcattgtt 
agtatactgt 
acccatcctc 
ttatcacttt 
ctgctgctgt 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1295 



<210> 646 

<211> 3411 

<212> DNA 

<213> Homo sapiens 



<400> 646 

gggcccgggg tcccgccacc accgcgcgcg ggacagattg attcactttg gagctgtaag 60 

tactgatgta ttagggtgca gcgctcattg ttcattgacg cagagtccca aaatgaatat 120 

ccaagagcag ggtttcccct tggacctcgg agcaagtttc accgaagatg ctccccgacc 180 

cccagtgcct ggtgaggagg gagaactggt gtccacagac ccgaggcccg ccagctacag 240 

tttctgctcc gggaaaggtg ttggcattaa aggtgagact tcgacggcca ctccgaggcg 3 00 

ctcggatctg gacctggggt atgagcctga gggcagtgcc tcccccaccc caccatactt 3 60 

gaagtgggct gagtcactgc attccctgct ggatgaccaa gatgggataa gcctgttcag 42 0 

gactttcctg aagcaggagg gctgtgccga cttgctggac ttctggtttg cctgcactgg 4 80 

cttcaggaag ctggagccct gtgactcgaa cgaggagaag aggctgaagc tggcgagagc 54 0 

catctaccga aagtacattc ttgataacaa tggcatcgtg tcccggcaga ccaagccagc 600 

caccaagagc ttcataaagg gctgcatcat gaagcagctg atcgatcctg ccatgtttga 660 

ccaggcccag accgaaatcc aggccactat ggaggaaaac acctatccct ccttccttaa 72 0 

gtctgatatt tatttggaat atacgaggac aggctcggag agccccaaag tctgtagtga 780 

ccagagctct gggtcaggga cagggaaggg catatctgga tacctgccga ccttaaatga 84 0 

agatgaggaa tggaagtgtg accaggacat ggacgaggac gatggcagag acgctgctcc 90 0 

ccccggaaga ctccctcaga agctgctcct ggagacagct gccccgaggg tctcctccag 960 

tagacggtac agcgaaggca gagagttcag gtatggatcc tggcgggagc cagtcaaccc 1020 

ctattatgtc aatgccggct atgccctggc cccagccacc agtgccaacg acagcgagca 1080 

gcagagcctg tccagcgatg cagacaccct gtccctcacg gacagcagcg tggatgggat 1140 

ccccccatac aggatccgta agcagcaccg cagggagatg caggagagcg cgcaggtcaa 12 00 

tgggcgggtg cccctacctc acattccccg cacgtaccgg gtgccgaagg aggtccgcgt 1260 

ggagcctcag aagttcgcgg aggagctcat ccaccgcctg gaggctgtgc agcgcacgcg 1320 

ggaggccgag gagaagctgg aggagcggct gaagcgcgtg cgcatggagg aggaaggtga 13 80 

ggacggcgat ccatcgtcag ggcccccagg gccgtgtcac aagctgcctc ccgcccccgc 144 0 

ttggcaccac ttcccgcccc gcttgtgttg gacatgggct tgtgccgggc tccgggatgc 1500 

acacgaggag aaccctgaga gcatcctgga cgagcacgta cagcgtgtgc tgaggacaac 1560 

tggccgccag tcgcctgggc ctggccatcg ctccccggac agtgggcacg tggccaagat 162 0 

gccagtggca ctggggggtg ccgcctcggg gcacgggaag cacgtaccca agtcaggggc 1680 

gaagctggac gcggccggcc tgcaccacca ccgacacgtc caccaccacg tccaccacag 1740 

cacagcccgg cccaaggagc aggtggaggc cgaggccacc cgcagggccc agagcagctt 18 00 

cgcctggggc ctggaaccac acagccatgg ggcaaggtcc cgaggctact cagagagtgt 1860 

tggcgctgcc cccaacgcca gtgatggcct cgcccacagt gggaaggtgg gcgttgcgtg 1920 

caaaagaaat gccaagaagg ctgagtcggg gaagagcgcc agcaccgagg tgccaggtgc 19 80 

ctcggaggat gcggagaaga accagaaaat catgcagtgg atcattgagg gggaaaagga 2 040 

gatcagcagg caccgcagga ccggccacgg gtcttcgggg acgaggaagc cacagcccca 2100 

tgagaactcc agaccyttgt cccttgagca cccctgggcc ggccctcagc tccggacctc 2160 

cgtgcagccc tcccacctct tcatccaaga ccccaccatg ccaccccacc cagctcccaa 2220 
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ccccctaacc cagctggagg aggcgcgccg acgtctggag gaggaagaaa agagagccag 22 80 

ccgagcaccc tccaagcaga ggtatgtgca ggaggttatg cggcggggac gcgcctgcgt 2 34 0 

caggccagcg tgcgcgccgg tgctgcacgt ggtaccagcc gtgtcggaca tggagctctc 2400 

cgagacagag acaagatcgc agaggaaggt gggcggcggg agtgcccagc cgtgtgacag 2460 

catcgttgtg gcgtactact tctgcgggga acccatcccc taccgcaccc tggtgagggg 2520 

ccgcgctgtc accctgggcc agttcaagga gctgctgacc aaaaagggca gctacagata 258 0 

ctacttcaag aaagtgagcg acgagtttga ctgtggggtg gtgtttgagg aggttcgaga 264 0 

ggacgaggcc gtcctgcccg tctttgagga gaagatcatc ggcaaagtgg agaaggtgga 27 00 

ctgataggct ggtgggctgg ccgctgtgcc aggcgaggcc cttggcgggc acgggtgtca 2 7 60 

cggccaggca gatgacctcg tactcaggag cccgatgggg aacagtgttg ggtgtaccac 2 82 0 

ccatccctgt ggtctacccg tgtctagagg caggtagggg gtccctccaa gtggtccaca 2880 

agcttctgtc ctgcccccaa ggaggcagcc tggaccactc ctcatagcaa tacttggagg 2940 

gcccagccca agtgaggcag ccgaggtccc tgctgccagc ttcaggtgac ccccccccat 3 000 

cccccggcac ctcccttggg cacgtgtgct gggatctact ttccctctgg gatttgccca 3060 

cgtacccagg tctggctggg gcccaggccc ggatgcagag gcctgcaggg cctctgtcaa 312 0 

ttgtacgcgc caccaagtgc cttcaacaca gcttgtctct tgcctgccac tgtgtgaatc 3180 

ggcgacggag cactgcacct gcctccagcc gccggctgtg cagtcctggg tcctcctttc 3240 

tgagggcccg tgtaaatatg tacatttctc aggctagggc cagcaggggc tgcccgagtc 33 0 0 

tgtttttcat gcgatgacac ttgtacaatt atcttttcaa aggtacttgg ataataatga 3360 

aataaaactg tttttgaacc tgaataaaaa aaaaaaaaaa aaaaaaaaaa a 3411 

<210> 647 
<211> 3140 
<212> DNA 

<213> Homo sapiens 
<400> 647 

ggctgcgagt acctccatgg tcccggtggc tgtgacggcg gcagtggcgc ctgtcctgtc 60 

cataaacagc gatttctcag atttgcggga aattaaaaag caactgctgc ttattgcggg 12 0 

ccttacccgg gagcggggcc tactacacag tagcaaatgg tcggcggagt tggctttctc 18 0 

tctccctgca ttgcctctgg ccgagctgca accgcctccg cctattacag aggaagatgc 240 

ccaggatatg gatgcctata ccctggccaa ggcctacttt gacgttaaag agtatgatcg 3 00 

ggcagcacat ttcctgcatg gctgcaatag caagaaagcc tattttctgt atatgtattc 3 60 

cagatatctg tctggagaaa aaaagaagga cgatgaaaca gttgatagct taggccccct 420 

ggaaaaagga caagtgaaaa atgaggcgct tagagaattg agagtggagc tcagcaaaaa 480 

acaccaagct cgagaacttg atggatttgg actttatctg tatggtgtgg tgcttcgaaa 54 0 

actggacttg gttaaagagg ccattgatgt gtttgtggaa gctactcatg ttttgccctt 600 

gcattgggga gcctggttag aactctgtaa cctgatcaca gacaaagaga tgctgaagtt 660 

cctgtctttg ccagacacct ggatgaaaga gttttttctg gctcatatat acacagagtt 720 

gcagttgata gaggaggccc tgcaaaagta tcagaatctc attgatgtgg gcttctctaa 7 80 

gagctcgtat attgtttccc aaattgcagt tgcctatcac aatatcagag atattgacaa 84 0 

agccctctcc atttttaatg agctaaggaa acaagaccct tacaggattg aaaatatgga 900 

cacattctcc aaccttcttt atgtcaggag catgaaatcg gagttgagtt atctggctca 960 

taacctctgt gagattgata aataccgtgt agaaacgtgc tgtgtaattg gcaattatta 102 0 

cagtttacgt tctcagcatg agaaagcagc cttatatttc cagagagccc tgaaattaaa 1080 

tcctcggtat cttggtgcct ggacactaat gggacatgag tacatggaga tgaagaacac 114 0 

gtctgctgct atccaggctt atagacatgc cattgaggtc aacaaacggg actacagagc 1200 

ttggtatggc ctcgggcaga cctatgaaat ccttaagatg ccattttact gcctttatta 1260 

ttatagacgg gcccaccagc ttcgacccaa tgattctcgc atgctggttg ctttaggaga 132 0 

atgttacgag aaactcaatc aactagtgga agccaaaaag tgttattgga gagcttacgc 13 8 0 

cgtgggagat gtggagaaaa tggctctggt gaaactggca aagcttcatg aacagttgac 144 0 

tgagtcagaa caggctgccc agtgttacat caaatatatc caagatatct attcctgtgg 1500 

ggaaatagta gaacacttgg aggaaagcac tgcctttcgc tatctggccc agtactattt 1560 

taagtgcaaa ctgtgggatg aagcttcaac ttgtgcacaa aagtgttgtg catttaatga 162 0 
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tacccgggaa 
gactcctacc 
tacccccaca 
tctcaagcca 
gcacgtctgt 
ggacagcttg 
tatcttgcca 
ggccttcttc 
ttagcacagg 
gaaagtctgc 
cagagcaatg 
tgtcagagat 
caatcttcca 
ttcaactaga 
tagcccttac 
aaggagatag 
gacaagaata 
tttttttccc 
aagggaatca 
ggaggtaata 
tctgagctca 
cctaacatat 
attcatttgt 
atgccttttc 
tttataattt 
ataaaaaaaa 



gaaggtaagg 
accgaggtgc 
cgcagagttt 
gcacattgtt 
tccaaggacc 
tataattacc 
gttagaagta 
cctgaatgct 
caatgctaca 
cactctaccc 
gatacaaatg 
taggtttaaa 
aaacagtttt 
gttgaataaa 
tgagggcaaa 
gattatagat 
aactcacttc 
tcccatctgc 
ttcttatggt 
tgctataatg 
gggtccagac 
gacacttcat 
tactaagcct 
aaggttccca 
gttttaatga 
aaaaaaaaaa 



ccttactccg 
ctgctccctt 
ctccactcaa 
agacccatct 
tcagctcttc 
ttcagaggcc 
cttctgtctc 
tttttttttt 
ggaacaaagt 
tgagctggca 
accttgctct 
tggaatagag 

gggggggtct 

agaagggcaa 
agatggctat 
aatttcacgg 
aagctggtct 
aacagaatac 
gctcttggag 
tctgctgtaa 
tttattctta 
cttatattaa 
tctcaggcaa 
tggtccagaa 
ttttgctaca 



gcaaatccta 
tttcctacct 
cttgtcttct 
taattaagcc 
ttgtttctac 
aactgacaga 
acttatgtcc 
ggcccccaag 
ttcaatgctg 
gtagggcact 
tggatttgct 
ctatccattt 
tctaaagctt 
aaataatctc 
attgttagct 
acatttggaa 
gttttaataa 
atttttttca 
agtttcaggc 
tatatttgca 
ttcccagaat 
ggaaggttta 
gctgtatact 
tgatgtttga 
tttggaattc 



cagcttcgga 
gcttcactct 
gtcacgccat 
ttacctccat 
agatggcaac 
atcctggcag 
aaagagtggc 
aaagtccctt 
ctgagagtga 
gagtaccctt 
gagcatgatc 
gttcttactc 
tcaaattgga 
acagagcttg 
atactcctac 
ataacattgg 
attttcaacg 
gcctttatct 
ctgtgcatgt 
cagtagatgc 
tttgtgttac 
gaatatctaa 
agttactggt 
ttcttaattt 
aataaaaaat 



accaaggcga 
ctgctaacaa 
agttggctac 
gtaaagaaca 
agctccatag 
gaacagacat 
tatagatctt 
ttatagcact 
aagaaaggag 
aggaagaagt 
cctattctga 
tctagggaga 
agtaacttta 
gaactgctga 
caaagcaagc 
tgattataca 
taattgtcta 
agatgaggta 
gtgtacagca 
tatggatcat 
gtttttacct 
tacgacttga 
ctccactgcc 
ttctgtccct 
gtgaacaata 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3140 



<210> 648 
<211> 3513 
<212> DNA 

<213> Homo sapiens 



<400> 648 

ccgtgtacca 

agctggagca 

cttcgctagg 

tggaagctct 

gtgtgaaggc 

cagaggtcct 

ggggccggga 

acaggacggc 

aactgaccct 

cccgggaaag 

tggaggaggc 

agctgtacca 

ctggctggga 

agctgactga 

ggctgtttca 

cagagctgga 

ggcaccctga 

gctcagaggc 

tggccctgga 

gcctgtgtgt 

gcttcgatcg 



ggtgctgcta 
gttgccttct 
aggaggcctg 
actacagaac 
tgtgccccag 
gatgcagcag 
gctgacatgg 
aatggacaag 
gcagagcaac 
cggactgcac 
tggggagccc 
ggttgcccag 
ggcggctgaa 
attctctagg 
gctcttcagg 
gcaggaacgc 
cttgcctcct 
catccgccag 
ccaaaagctt 
cagccaggtc 
gaatctgggg 



gtgggaagca 
cagagcctgc 
ccttactgcc 
tgccaggcag 
cccatggagc 
gtgctagact 
ctgaagcaag 
gctgacgagc 
cagcgaatac 
cagattgaag 
tcgctggaca 
gagcaggtca 
ctggaccccc 
gctttggccc 
gaggccttga 
ccgggggttg 
gcccacttcc 
gagtgccgct 
gaggcttcac 
cccgctgcac 
cagagtctca 



cgctgctgaa 
tgacccacat 
accaggcctg 
cttgtgccct 
ctggggaggt 
cgccatggct 
aggtcccaga 
tatatgaccg 
aggccctaga 
tgtggctgca 
tgctgctcca 
ggcaagggga 
ctggggcacg 
agcggtgcca 
cgtgggctga 
tgttgcagca 
gaaagatgtg 
gggcctgggc 
tgaagctacc 
ctgcccaccc 
gtgaacctgc 



ggaagtgcct 
cccaacggcg 
gctggatttc 
gctccagggg 
cggtcagctg 
ggcatggcta 
ggtgaccctg 
ggtggatgga 
gttggtccaa 
gcaggtgggc 
ggcccaaggc 
gaagtttctg 
ctttctggcc 
gcggctggcg 
ggaggggcag 
gctgcagctg 
ggctctggcc 
gcggtgccag 
accggtgggc 
tcccctgagg 
ctgccactgc 



tccgggctgc 
gggctgccca 
cgaaggcggc 
gccatcgaaa 
ctacagcaga 
caatgccagg 
agcccagact 
ttgctgcacc 
acactggagg 
tggccagcac 
tcttttcagg 
cagccgctga 
ctgcgagccc 
gatgctgaga 
cgagtgttgg 
cactggacca 
acggggctgg 
gacacctggc 
agcacagcta 
aaggcctaca 
caccatgcgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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ccactattgc tgcctgccgc agaccagagg ctggaggagg tgccctgccc caggcatccc 132 0 

ctactgtgcc tccaccaggc agctctgacc ccaggagcct caacaggcta cagctggtgc 138 0 

tggcagagat ggtggccacg gagcgggagt atgtccgggc tctagagtac actatggaga 144 0 

actatttccc cgagctggat cgccccgatg tgccccaggg cctccgcggt cagcgtgccc 150 0 

acctctttgg caacctggag aagctgcggg acttccactg ccacttcttc ctgcgtgagc 1560 

tggaggcttg cacccggcac ccaccacgag tggcctatgc cttcctgcgc catagggtgc 162 0 

agtttgggat gtacgcgctc tacagcaaga ataagcctcg ctccgatgcc ctgatgtcaa 1680 

gctatgggca caccttcttc aaggacaagc agcaagcact gggggaccac ctggacctgg 174 0 

cctcctacct gctaaagccc atccagcgca tgggcaagta cgcactgctg ctgcaggagc 1800 

tggcacgggc ctgcgggggc cccacgcagg agctcagtgc gctgcgggag gcccagagcc 1860 

ttgtgcactt ccagctgcgg cacggaaacg acctgctggc catggacgcc atccagggct 192 0 

gtgatgttaa cctcaaggaa caggggcagc tggtgcgaca ggatgagttt gtggtgcgca 198 0 

cfcgggcgcca caagtccgtg cgccgcatct tcctttttga ggagctgctg ctcttcagca 2040 

agcctcgcca tgggcccaca ggggttgaca catttgccta caagcgctcc ttcaagatgg 2100 

cagaccttgg tctcactgag tgctgtggga acagcaacct gcgcttcgag atctggttcc 2160 

gccgccgcaa ggccagggac acctttgtgc tgcaggcctc cagcctggct atcaagcagg 222 0 

cctggacagc tgacatctcc cacctgcttt ggaggcaggc cgtccacaac aaggaggtgc 22 8 0 

gcatggctga gatggtgtcc atgggtgtgg ggaacaaggc cttccgagac attgctccca 2 34 0 

gcgaggaagc catcaacgac cgcaccgtca actatgtcct gaagtgccga gaagttcgct 2400 

ctcgggcgtc cattgccgta gccccgtttg accatgacag cctctacctg ggggcctcga 2460 

actcccttcc tggagaccct gcctcttgct ctgttctggg gtccctcaac ctgcacctgt 2520 

acagagaccc agctcttctg ggtctccgct gtcccctgta tcccagcttc ctagaggaag 2580 

cagcactgga ggctgaggca gagctgggcg gccagccctc tttgactgct gaggactcag 264 0 

agatctcgtc ccaatgccca tcagccagtg gctccagtgg ctctgacagc agctgtgtgt 2 7 00 

cagggcaggc cctgggtagg ggcctggagg acttaccctg tgtctgagcc cgggactgga 2760 

cgagcagtag atccagcagc ctgcagctcc aaggaacatt gcctctctgg atctgctgtg 2820 

accagggtgt ggctgacacc tgggctacct ccaacctaca tgtgcaacgc tgttgactac 2880 

cctttctgat gtgtgtggcc attggactaa ctggcacggg gcctctctag ggaagtctgg 2940 

ttgtagagcc tgaataggct cctggcccca tgaccccttc tcctgtcccc agctcccatc 3 000 

ccagttgtgg gttaagaata ggctagagca gacattgggt gtttccatgc tgtaggctgg 3060 

tgggggacca tgtgcctcta ggcagtgact agggtgcccc cacccctcag gaagaacaca 312 0 

ggtgggctcc tagcagctga tccccaatgc ctggccttaa agccgagctc agttaccata 3180 

gggacaggtc cacctctact gggccctcat gcttgccttt cctggccccc aggcccagcc 3240 

cctttttact ggggcagttt cgttattttg acttgatgcc ttttgaataa ctttcaatag 33 00 

aattgtctaa aattatctta ctggttgtta ggcctttggt gtctcagaga aggagtctag 33 60 

gtctttgatg tgtgatttaa tcttttattt gtttataata aaaaatagac tgattfcgtaa 3420 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 34 80 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 3513 

<210> 649 

<211> 3597 

<212> DNA 

<213> Homo sapiens 

<400> 649 

catgccagtt acttcctcag gaaaatattt tcttgccttc ttctttcagt atggttttaa 60 

atttgggaac agtggataac ccaagtgtcc cacaggccaa ggtatattcc aatggcagca 120 

tgatccctgc acccaaagcc agcccctaaa gcctacccct tgtgcacccg cagcctggta 180 

agtgagcttg gctgcttgtg aggagctaca agtgaaagag aagttatttt aaataaatcc 240 

caaagtttga ggcagactgt ccaggactgt tcccaggaag aagcaggagt tacccacagg 3 00 

aaaagtctct gacctggtcc cctcaggccc agctacctgc gcccaccagc agtgaaggtt 3 60 

gatgtactgg cccagcatct ccacctcccc catgcaacca ggtccctggt accgtgtctc 420 

ccgttgcatg tctggcttct gcctgtgctc ctcctgccac gagcatcctc cctgtccctc 4 80 

ctcattccac cgtgtctctc ctgcacacat agcctctgtc ccagggcgat ttatccactt 540 
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gagtacagga 
taccccaccc 
gggtccccat 
atccttacca 
ccctctgtga 
ggacacagta 
aaggacaaac 
gtaggaaggt 
ctccttccag 

gggtctttcc 

ccaggcagct 
tggtcctcgt 
ctgggagacc 
gggccattta 
aaagtaagaa 
aaacctatga 
tgaaaaaaaa 
tcacatggct 
ctaccgcctc 
cttatagagt 
gctgggacac 
acctaaatga 
gcaagagaga 
tttattagag 
tacatactca 
aagcaaagtt 
attggcagag 
ctttattcca 
ctgccgaatc 
gatgggtcat 
ggtctatgca 
tatctttgat 
aaaacagagg 
accgaacagg 
cgaagatggg 
tagttagctt 
aagaatcttg 
ctcaccactc 
gttggctgct 
tgccactgcc 
cctgctgagt 
gagccgacgt 
ttcaaatgct 
tctttgagtc 
acccccttcc 
ccctttgtta 
tgtatgtgga 
tcctagtaaa 
aattggagcc 
gtatacctgt 
tatttgtaat 

<210> 650 
<211> 855 



gctgctcaga 
aaagctgcct 
ccgaggatcc 
ctgtttccag 
cctaaaaaga 
atgtgaagtc 
ctcccacctc 
gctctgcagg 
gaacagtgag 
atgactgagt 
ccgtgccctg 
ctacatcaag 
ttgaaccctc 
tttgggattt 
gggagccctg 
cttccaaggg 
aaaatgggtc 
gctgctattc 
ctgccacctc 
ggggagctga 
tctgcaaacc 
ccactcctct 
gttttttttt 
ggagtatcca 
ttaggatccc 
aaacaaaacg 
gtagtgaggt 
ttcccacccg 
ctacacatct 
gggaagaccc 
gctcttcttc 
cttttgactc 
atacagttaa 
actaaggaac 
atacacgtgc 
gttgacttgt 
tttccattcg 
cctaccttgg 
ttgagcagga 
ctcctcccca 
taccatagca 
gtggggatga 
agagaagggg 
acatagagaa 
agccccatcc 
gaacaaagca 
tgagatctct 
atattaaaaa 
cagttgtaac 
gtatgtaaat 
atagaataaa 



cctctcagcc 
tcctggctgt 
acaagcaatg 
agccgtgaac 
aaagctcaat 
aaatacttaa 
acagacagcc 
cacaaggccc 
cccagtgagc 
cacctccagg 
gcttctcccc 
ggcctcttcc 
agcctcttcc 
tggagttgtt 
ctatggatgt 
gatgaggaga 
aggacattcc 
ctggtgcttc 
accctacagg 
gacacagcct 
tctcctcatg 
catttggaaa 
ttttttacta 
agttttaaaa 
gattaaattc 
aaacaaaatc 
gattcaggta 
ctttgatgtc 
cttatcagct 
aagctttccc 
caacaatttc 
aagcacattt 
agagcaaccc 
actttcgtgc 
tcatctgtct 
taaaccttct 
gaaaccaacg 
cctggtgtcc 
acgaggcctc 
ctttagcatg 
accagcaact 
tgcaggggtt 
agaaacagga 
gccgttgaag 
cacttctgtt 
aagcatatct 
aagaagaggg 
aaaattaaaa 
ttggtaaagg 
ataaggcatt 
gtttattaaa 



cagccctctg 
aggagctccc 
acttcccaaa 
gatgctgtga 
ttccatctgt 
ccgagcaaag 
aagcagtgag 
agagaagccc 
agtcccagcc 
agctcacctg 
atgccccaaa 
cctcttcctc 
tttaaaaaaa 
tggtttttgt 
taagtccaaa 
ggttcagagg 
ctggatgagg 
cccacacttt 
cattctctat 
caggtaacac 
gtgctaaggg 
gtaatccact 
caaatttttg 
gtatatagaa 
cttgagtaga 
cttcatatac 
ggctgaaaat 
tatgcttggc 
ttcctcaaac 
tccgccagga 
tttatttgga 
tggaagggct 
aaaggacgct 
acagaagtca 
gtcctccttt 
gttcttaaaa 
attttgtgtt 
ttgagacata 
caggccctga 
taggggccag 
ccagggtacc 
ttggcccagc 
tggaagggtg 
gaggtagggc 
tcaaactaaa 
ttaaacaaca 
tcttctggtt 
agtttttaaa 
caagcttctg 
cctattttgc 
aaataagaaa 



tgactgcccc 
tcgtctagcc 
tgacctccac 
tggcccaggt 
cttctttccc 
ggccagtgtt 
ggaaagacag 
ctctccggga 
agctcttcaa 
acccccagag 
tcccccccag 
ccagctctca 
acaaaacaaa 
ttttatatct 
ttactcggtt 
acaggaggag 
acaatgctag 
tgacagatgg 
gtaggaaaca 
tgacacagct 
tggcatgctc 
gcagtaaaag 
ctcccccata 
ttttttggtt 
ctggtgccta 
aaaaagaact 
cctgggttgc 
tctctgggct 
tttaaggagg 
ttgcaaaagc 
attagaactt 
cccttacaaa 
taagaaaccg 
gccgcaatcc 
cctctccctc 
tgaaaagcta 
ttagaatgga 
cggtctttgc 

ggtgggaagg 

cccatctctt 
acaacagaca 
cagaggaccc 
gtttaaggaa 
aggttatctc 
gctcccacct 
gtgttaaaat 
ttgattttta 
aaggaaacct 
tacctttgtt 
agttcagaac 
aaaaaaaaaa 



agccccatcc 
aaggccctat 
tgcaagaaga 
ctcagcacca 
aggaccaagg 
gttatcagtc 
acagacatag 
acttcccctg 
ggccttcaag 
aagacctacc 
ccatccctcc 
ggacaggtga 
acaaaactgt 
taatagttcg 
agtgggagca 
cctcccccat 

gggtggcatc 

agtccttctc 
agagccttat 
cccgaatgag 
ttgacaggaa 
tttcagacat 
aaattatttt 
gtaagagaaa 
ccagaaagca 
ttctgtttgt 

gggagcctca 

gcccctggta 
ctctgtgagg 
aagtagactt 
cctttgttag 
agtagaattt 
agaccacttc 
aggcacagga 
cccgacgttc 
gcttacctca 
cagccctccc 
ttagtcgtgt 
aaggattgga 
ccagcagggt 
atggctcagc 
agagttgagc 
ctggcagggg 
tgttccagtc 
cgaacattga 
gagcctcaaa 
aagaagagta 
atgctattta 
ataattaatt 
aaaaaaaact 
aaaaaaa 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3597 
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<212> DNA 
<213> Homo 



sapiens 



<400> 650 

cgctcaatta 

ggatagcaac 

catgccacgt 

gcagggagga 

cgcacgcagt 

ttgcacacgg 

atttccatga 

tacaggaagt 

agtcgtctcc 

tggcacgcct 

tcccgcgccg 

cgctcggccg 

aagggggccg 

aggaagtcag 

aggatctctt 



tctactcgag tctagactcg aggcggccgc ccattgtgca ctaaagcagg 
ggcgtccctc ctccccgctc agctgcagcc cgcagtcctc acagtggtaa 
ggtagtctct gtccatggac accacacgga tggttgtctc gcagccctgt 
taggacgggc acaggaggcg cattttggtg caaaaaccgt gtgatagtct 
agatgttgtt ctccacgtcc acggtgaagg gaaccccgtc caggcactca 
agcaccggaa gcagcctggg tggtaggact tgcccagggc ctgcaggatc 
tgagatgtcc acacacgctg catttgtcgg ccgtctgctg gaacccggag 
cctcctggca gtacactttc tcacccacgt tgtagaacgc cttcccacgg 
cacacgagtc gcaggtgaag cagtcagtgt gataaagact ccccattgcc 
gctgggctcc gtagatgcca agcccacact tgatgcaaat gccgaagtag 
tgcgcgcctc gagcgcccgc tccagctccc gggtgagcgc ctccagccgc 
cgcttgggcc gccctcccgg ccagggggca gcgggagtgc aggcagcggg 
gccccgcagg ctccggggag cgagcgggcg cggggcaggc gccgggcggg 
cgtaggcttt agtgcacaat gggaattcgg atccggtcga cactagttct 
ttttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
855 



<210> 651 

<211> 3791 

<212> DNA 

<213> Homo sapiens 

<400> 651 

acagacggcg ggtgaacatg gcgtcctcga cttggtctga gacgtgatag gcctgccttc 60 

tggttgaaga tgtggcgagt gaaaaaactg agcctcagcc tgtcgccttc gccccagacg 12 0 

ggaaaaccat ctatgagaac tcctctccgt gaacttaccc tgcagcccgg tgccctcacc 180 

acctctggaa aaagatcccc cgcttgctcc tcgctgaccc catcactgtg caagctgggg 24 0 

ctgcaggaag gcagcaacaa ctcgtctcca gtggattttg taaataacaa gaggacagac 30 0 

ttatcttcag aacatttcag tcattcctca aagtggctag aaacttgtca gcatgaatca 360 

gatgagcagc ctctagatcc aattccccaa attagctcta ctcctaaaac gtctgaggaa 42 0 

gcagtagacc cactgggcaa ttatatggtt aaaaccatcg tccttgtacc atctccactg 480 

gggcagcaac aagacatgat atttgaggcc cgtttagata ccatggcaga gacaaacagc 540 

atatctttaa atggaccttt gagaacagac gatctggtga gagaggaggt ggcaccctgc 600 

atgggagaca ggttttcaga agttgctgct gtatctgaga aacctatctt tcaggaatct 660 

ccgtcccatc tcttagagga gtctccacca aatccctgtt ctgaacaact acattgctcc 72 0 

aaggaaagcc tgagcagtag aactgaggct gtgcgtgagg acttagtacc ttctgaaagt 780 

aacgccttct tgccttcctc tgttctctgg ctttcccctt caactgcctt ggcagcagat 840 

ttccgtgtca atcatgtgga cccagaggag gaaattgtag agcatggagc tatggaggaa 900 

agagaaatga ggtttcccac acatcctaag gagtctgaaa cagaagatca agcacttgtc 960 

tcaagtgtgg aagatattct gtccacatgc ctgacaccaa atctagtaga aatggaatcc 102 0 

caagaagctc caggcccagc agtagaagat gttggtagga ttcttggctc tgatacagag 1080 

tcttggatgt ccccactggc ctggctggaa aaaggtgtaa atacctccgt catgctggaa 1140 

aatctccgcc aaagcttatc ccttccctcg atgcttcggg atgctgcaat tggcactacc 1200 

cctttctcta cttgctcggt ggggacttgg tttactcctt cagcaccaca ggaaaagagt 1260 

acaaacacat cccagacagg cctggttggc accaagcaca gtacttctga gacagagcag 132 0 

ctcctgtgtg gccggcctcc agatctgact gccttgtctc gacatgactt ggaagataac 13 80 

ctgctgagct ctcttgtcat tgtggagttt ctctcccgcc agcttcggga ctggaagagc 1440 

cagctggctg tccctcaccc agaaacccag gacagtagca cacagactga cacatctcac 150 0 

agtgggataa ctaataaact tcagcatctt aaggagagcc atgagatggg acaggcccta 1560 

cagcaggcca gaaatgtcat gcaatcatgg gtgcttatct ctaaagagct gatatccttg 162 0 

cttcacctat ccctgttgca tttagaagaa gataagacta ctgtgaatca ggagtctcgg 1680 
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cgtgcagaaa 
cagagcctca 
aaggatgcgg 
cagctggaac 
acccaactgg 
acttctacct 
ttgctgagtc 
caggaacgtg 
tctgcccagt 
ctagcaacag 
gagctacaga 
tgccagctta 
ctaaaacaca 
gacctgaaag 
ggtcaggttg 
tgtgagaacc 
gataaccagg 
actgagcaac 
caagagaccc 
gacaggacct 
cctgtgccct 
cttcagcccg 
actgagcttc 
ctgcagcgaa 
gaagtccaga 
tacaagaatg 
atagacaaga 
cttcggcgtg 
ccaactgcca 
tggacaaact 
gttccagagc 
aaaaactaga 
tgaggggatg 
agaatttgtt 
ggcatagagc 
aaaaaaaaaa 



cattggtctg 
aagcagaaag 
cagagatagt 
aggacctagc 
tagggcttca 
tgcaacaaga 
ggtcccgaca 
atgtggcaat 
tagaggagtg 
atctccgggc 
gtcagcatac 
cccagagcaa 
tgcaggcaga 
agaccttgga 
agtgtcaatt 
tcaaggacac 
agcaagatct 
tacagagcct 
ttctgctgag 
tcctgggaag 
tgcttggaag 
cagagacccc 
agagtctttg 
aaatttgtga 
aggcaaaaga 
aaaaggagct 
gtggcgagct 
cggagacaga 
gcctatggcc 
gagagtgatg 
ccatagaaat 
tgacattgtt 
caaagaacta 
tcacctcttt 
caactgagat 
a 



ttgctgtttt 
ggaggaggca 
gttggaggct 
atccatgcgg 
tgccaagcaa 
ctggaggtcc 
actcacagag 
tgaggaaaag 
caaaggccaa 
tcagttgcag 
ccattgtgcc 
tgaggagcag 
actgcagcag 
gtttgcagac 
gaaaaccaca 
tgtagagaac 
ggagaaaaca 
gactctcttt 
tacagcctgt 
catcttgaca 
tgacaagagt 
aggcatggag 
ttccctgcta 
gctgcaagct 
agcagacata 
ccaggaagtg 
cataagcctt 
gaccaaagtg 
accaattgga 
ttcctggaga 
atcctagagg 
cagcatattt 
cagggattgc 
ttacctgcaa 
aaatgctatt 



gatttgctga 
aggcacagag 
ttctgtgcac 
gaattcagag 
gaagagctgg 
atgcaactgg 
aaactcacag 
caggaggttt 
acagaacaac 
attctggcca 
caggacctgg 
gctgctcaat 
caacaagctg 
caggagaatc 
ctggaagtgc 
ctaacggcta 
cggcagtact 
ctacagacaa 
cctcccaccc 
gcagtggcag 
gctttcaccc 
gagagcctgg 
caagagtcta 
aggctgcagg 
gagaagctga 
atacagcaga 
agagaggagg 
ctccaggagg 
tccaggagaa 
tgaaaaatga 
agaaccttcg 
ataagaccct 
tggaatttct 
taccccctta 
taaataaagt 



agaaattgag 
aggaaatggc 
acgccagcca 
gccttctgaa 
ttcagcagac 
attatacaac 
tcaagagcca 
ctagggtgct 
tggagttgga 
acatggacag 
ctatgaagga 
gcgtaaagga 
tcctggccaa 
aggttgctca 
tccgggagcg 
aactggccag 
ctcaaaagct 
aactaaagga 
aggaacaccc 
atgaagagcc 
gagtagcatc 
cagaaatgag 
aagaagaagc 
cccaggaaga 
accaggcctt 
atgagaagat 
tgacccacct 
cctggcaggc 
agtgtggctc 
gaaggaaaac 
gcgctctgac 
gctctctatt 
gagctaagaa 
ccccaatacc 
gtatttaatg 



ggcaaagctc 
tctcagaggc 
gcgcatcagc 
ggatgcccag 
agtgagtctt 
atggacagct 
gcaagccctg 
ggaacaagtc 
aaacattcgt 
ccagctaaaa 
tgagttactc 
agagatggca 
agaggtgcgg 
cctggagctg 
cagcttgcag 
caccatagca 
agggctgctg 
gaagactgaa 
tctgcctaat 
agaatcaact 
aatggtttcc 
tattatgact 
catcaggact 
acagcatcag 
gtgcttgcgc 
cctagaacag 
tacccgctca 
cagctggact 
tctcaggagg 
tcctgatcaa 
aaggagttag 
ccagaggtgg 
actgaaagcc 
aagaccaact 
aaaaaaaaaa 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3791 



<210> 652 

<211> 1683 

<212> DNA 

<213> Homo sapiens 



<400> 652 

ctctgagtgt 

cttgacctat 

gaggcagctg 

cctatcccga 

gggacagagt 

gctgctacct 

tgtccccctc 

gccagctcca 

ctgccaacag 

gcggccttct 

tggtccctgg 



ccagtggtca 
ttggctgttg 
aggccagagc 
agcctcccca 
caagtgcagc 
gcccaactcc 
caaaaagagc 
tttggggacc 
gaccggtccc 
ccagacaatc 
aacctaccac 



gttgccccag 
cttctgctgc 
actttcaaga 
tggatcaccc 
cccctccctc 
ctgctgaaaa 
tgccctctct 
agagccatcc 
aagggggctg 
tgaaccaaat 
agtccagcta 



gatggggacc 
ctctgaggga 
agttggctac 
tgactcctct 
tcaggaggcc 
ggaagtgggt 
ccagcacccc 
agaacctgag 
gggccaccgg 
ctgccttcct 
ctcccacctc 



acagccagag 
ggcttcacgg 
gcagctcccc 
cagcatggcc 
acccctctcc 
ccccctctcc 
aatgaacaga 
tcctggaatg 
ctggatggct 
aaccgtcagc 
actcgccagg 



cagccttggt 
ctacaggaca 
cctccccacc 
ctccctttga 
aacaggaaaa 
ctcaggaagc 
aggaaggaac 
cagcccagca 
tcccccctgg 
atgtggtata 
gtgagaccct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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caatttcctg gagattggat attcccgctg ctgccactgc cgcagccaca caaaccgcct 720 

agagtgtgcc aaacttgtgt gggaggaagc aatgagccga ttctgtgagg ccgagttctc 780 

ggtcaagacc cgaccccact ggtgctgcac gcggcagggg gaggctcggt tctcctgctt 84 0 

ccaggaggaa gctccccagc cacactacca gctccgggcc tgccccagcc atcagcctga 900 

tatttcctcg ggtcttgagc tgcctttccc tcctggggtg cccacattgg acaatatcaa 960 

gaacatctgc cacctgaggc gcttccgctc tgtgccacgc aacctgccag ctactgaccc 102 0 

cctacaaagg gagctgctgg cactgatcca gctggagagg gagttccagc gctgctgccg 10 80 

ccaggggaac aatcacacct gtacatggaa ggcctgggag gatacccttg acaaatactg 1140 

tgaccgggag tatgctgtga agacccacca ccacttgtgt tgccgccacc ctcccagccc 12 0 0 

tactcgggat gagtgctttg cccgtcgggc tccttacccc aactatgacc gggacatctt 12 60 

gaccattgac atcagtcgag tcacccccaa cctcatgggc cacctctgtg gaaaccaaag 132 0 

agttctcacc aagcataaac atattcctgg gctgatccac aacatgactg cccgctgctg 13 8 0 

tgacctgcca tttccagaac aggcctgctg tgcagaggag gagaaattaa ccttcatcaa 144 0 

tgatctgtgt ggtccccgac gtaacatctg gcgagacccfc gccctctgct gttacctgag 1500 

tcctggggat gaacaggtca actgcttcaa catcaattat ctgaggaacg tggctctagt 1560 

gtctggagac actgagaacg ccaagggcca gggggagcag ggctcaactg gaggaacaaa 162 0 

tatcagctcc acctctgagc ccaaggaaga atgagtcacc ccagagccct agagggtcag 1680 

atg 1683 



<210> 653 
<211> 1696 
<212> DNA 

<213> Homo sapiens 



<400> 653 

cacctaaaag ccaaaatggg aaaggaaaag actcatatca acattgtcgt cattggacac 6 0 

gtagattcgg gcaagtccac cactactggc catctgatct ataaatgcgg tggcatcgac 12 0 

aaaagaacca ttgaaaaatt tgagaaggag gctgctgaga tgggaaaggg ctccttcaag 180 

tatgcctggg tcttggataa actgaaagct gagcgtgaac gtggtatcac cattgatatc 240 

tccttgtgga aatttgagac cagcaagtac tatgtgacta tcattgatgc cccaggacac 3 00 

agagacttta tcaaaaacat gattacaggg acatctcagg ctgactgtgc tgtcctgatt 360 

gttgctgctg gtgttggtga atttgaagct ggtatctcca agaatgggca gacccgagag 42 0 

catgcccttc tggcttacac actgggtgtg aaacaactaa ttgtcggtgt taacaaaatg 4 80 

gattccactg agccacccta cagccagaag agatatgagg aaattgttaa ggaagtcagc 54 0 

acttacatta agaaaattgg ctacaacccc gacacagtag catttgtgcc aatttctggt 600 

tggaatggtg acaacatgct ggagccaagt gctaacatgc cttggttcaa gggatggaaa 660 

gtcacccgta aggatggcaa tgccagtgga accacgctgc ttgaggctgt ggactgcatc 720 

ctaccaccaa ctcgtccaac tgacaagccc ttgcgcctgc ctctccagga tgtctacaaa 7 80 

attggtggta ttggtactgt tcctgttggc cgagtggaga ctggtgttct caaacccggt 840 

atggtggtca cctttgctcc agtcaacgtt acaacggaag taaaatctgt cgaaatgcac 900 

catgaagctt tgagtgaagc tcttcctggg gacaatgtgg gcttcaatgt caagaatgtg 960 

tctgtcaagg atgttcgtcg tggcaacgtt gctggtgaca gcaaaaatga cccaccaatg 102 0 

gaagcagctg gcttcactgc tcaggtgatt atcctgaacc atccaggcca aataagcgcc 10 80 

ggctatgccc ctgtattgga ttgccacacg gctcacattg catgcaagtt tgctgagctg 1140 

aaggaaaaga ttgatcgccg ttctggtaaa aagctggaag atggccctaa attcttgaag 12 00 

tctggtgatg ctgccattgt tgatatggtt cctggcaagc ccatgtgtgt tgagagcttc 1260 

tcagactatc cacctttggg tcgctttgct gttcgtgata tgagacagac agttgcggtg 132 0 

ggtgtcatca aagcagtgga caagaaggct gctggagctg gcaaggtcac caagtctgcc 13 80 

cagaaagctc agaaggctaa atgaatatta tccctaatac ctgccacccc actcttaatc 1440 

agtggtggaa gaacggtctc agaactgttt gtttcaattg gccatttaag tttagtagta 1500 

aaagactggt taatgataac aatgcatcgt aaaaccttca gaaggaaagg agaatgtttt 1560 

gtggaccact ttggttttct tttttgcgtg tggcagtttt aagttattag tttttaaaat 1620 

cagtactttt taatggaaac aacttgacca aaaatttgtc acagaatttt gagacccatt 1680 

aaaaaagtta aatgag 1696 
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<210> 654 

<211> 7680 

<212> DNA 

<213> Homo sapiens 

<400> 654 

gaagagcaag aggcaggctc agcaaatggt tcagccccag tccccggtgg ctgtcagtca 60 

aagcaagccc ggttgttatg acaatggaaa acactatcag ataaatcaac agtgggagcg 12 0 

gacctaccta ggtaatgtgt tggtttgtac ttgttatgga ggaagccgag gttttaactg 180 

cgaaagtaaa cctgaagctg aagagacttg ctttgacaag tacactggga acacttaccg 24 0 

agtgggtgac acttatgagc gtcctaaaga ctccatgatc tgggactgta cctgcatcgg 3 00 

ggctgggcga gggagaataa gctgtaccat cgcaaaccgc tgccatgaag ggggtcagtc 3 60 

ctacaagatt ggtgacacct ggaggagacc acatgagact ggtggttaca tgttagagtg 420 

tgtgtgtctt ggtaatggaa aaggagaatg gacctgcaag cccatagctg agaagtgttt 4 80 

tgatcatgct gctgggactt cctatgtggt cggagaaacg tgggagaagc cctaccaagg 54 0 

ctggatgatg gtagattgta cttgcctggg agaaggcagc ggacgcatca cttgcacttc 600 

tagaaataga tgcaacgatc aggacacaag gacatcctat agaattggag acacctggag 660 

caagaaggat aatcgaggaa acctgctcca gtgcatctgc acaggcaacg gccgaggaga 720 

gtggaagtgt gagaggcaca cctctgtgca gaccacatcg agcggatctg gccccttcac 7 80 

cgatgttcgt gcagctgttt accaaccgca gcctcacccc cagcctcctc cctatggcca 840 

ctgtgtcaca gacagtggtg tggtctactc tgtggggatg cagtggttga agacacaagg 90 0 

aaataagcaa atgctttgca cgtgcctggg caacggagtc agctgccaag agacagctgt 960 

aacccagact tacggtggca acttaaatgg agagccatgt gtcttaccat tcacctacaa 1020 

tggcaggacg ttctactcct gcaccacgga agggcgacag gacggacatc tttggtgcag 1080 

cacaacttcg aattatgagc aggaccagaa atactctttc tgcacagacc acactgtttt 114 0 

ggttcagact caaggaggaa attccaatgg tgccttgtgc cacttcccct tcctatacaa 12 00 

caaccacaat tacactgatt gcacttctga gggcagaaga gacaacatga agtggtgtgg 1260 

gaccacacag aactatgatg ccgaccagaa gtttgggttc tgccccatgg ctgcccacga 13 2 0 

ggaaatctgc acaaccaatg aaggggtcat gtaccgcatt ggagatcagt gggataagca 13 8 0 

gcatgacatg ggtcacatga tgaggtgcac gtgtgttggg aatggtcgtg gggaatggac 1440 

atgcattgcc tactcgcaac ttcgagatca gtgcattgtt gatgacatca cttacaatgt 1500 

gaacgacaca ttccacaagc gtcatgaaga ggggcacatg ctgaactgta catgcttcgg 1560 

tcagggtcgg ggcaggtgga agtgtgatcc cgtcgaccaa tgccaggatt cagagactgg 162 0 

gacgttttat caaattggag attcatggga gaagtatgtg catggtgtca gataccagtg 1680 

ctactgctat ggccgtggca ttggggagtg gcattgccaa cctttacaga cctatccaag 174 0 

ctcaagtggt cctgtcgaag tatttatcac tgagactccg agtcagccca actcccaccc 1800 

catccagtgg aatgcaccac agccatctca catttccaag tacattctca ggtggagacc 1860 

taaaaattct gtaggccgtt ggaaggaagc taccatacca ggccacttaa actcctacac 192 0 

catcaaaggc ctgaagcctg gtgtggtata cgagggccag ctcatcagca tccagcagta 1980 

cggccaccaa gaagtgactc gctttgactt caccaccacc agcaccagca cacctgtgac 2 04 0 

cagcaacacc gtgacaggag agacgactcc cttttctcct cttgtggcca cttctgaatc 2100 

tgtgaccgaa atcacagcca gtagctttgt ggtctcctgg gtctcagctt ccgacaccgt 2160 

gtcgggattc cgggtggaat atgagctgag tgaggaggga gatgagccac agtacctgga 222 0 

tcttccaagc acagccactt ctgtgaacat ccctgacctg cttcctggcc gaaaatacat 22 80 

tgtaaatgtc tatcagatat ctgaggatgg ggagcagagt ttgatcctgt ctacttcaca 2340 

aacaacagcg cctgatgccc ctcctgaccc gactgtggac caagttgatg acacctcaat 2400 

tgttgttcgc tggagcagac cccaggctcc catcacaggg tacagaatag tctattcgcc 2460 

atcagtagaa ggtagcagca cagaactcaa ccttcctgaa actgcaaact ccgtcaccct 252 0 

cagtgacttg caacctggtg ttcagtataa catcactatc tatgctgtgg aagaaaatca 25 80 

agaaagtaca cctgttgtca ttcaacaaga aaccactggc accccacgct cagatacagt 2640 

gccctctccc agggacctgc agtttgtgga agtgacagac gtgaaggtca ccatcatgtg 2700 

gacaccgcct gagagtgcag tgaccggcta ccgtgtggat gtgatccccg tcaacctgcc 2760 

tggcgagcac gggcagaggc tgcccatcag caggaacacc tttgcagaag tcaccgggct 2 82 0 

gtcccctggg gtcacctatt acttcaaagt ctttgcagtg agccatggga gggagagcaa 2 880 
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gcctctgact gctcaacaga caaccaaact ggatgctccc actaacctcc agtttgtcaa 
tgaaactgat tctactgtcc tggtgagatg gactccacct cgggcccaga taacaggata 
ccgactgacc gtgggcctta cccgaagagg ccagcccagg cagtacaatg tgggtccctc 
tgtctccaag taccccctga ggaatctgca gcctgcatct gagtacaccg tatccctcgt 
ggccataaag ggcaaccaag agagccccaa agccactgga gtctttacca cactgcagcc 
tgggagctct attccacctt acaacaccga ggtgactgag accaccatcg tgatcacatg 
gacgcctgct ccaagaattg gttttaagct gggtgtacga ccaagccagg gaggagaggc 
accacgagaa gtgacttcag actcaggaag catcgttgtg tccggcttga ctccaggagt 
agaatacgtc tacaccatcc aagtcctgag agatggacag gaaagagatg cgccaattgt 
aaacaaagtg gtgacaccat tgtctccacc aacaaacttg catctggagg caaaccctga 
cactggagtg ctcacagtct cctgggagag gagcaccacc ccagacatta ctggttatag 
aattaccaca acccctacaa acggccagca gggaaattct ttggaagaag tggtccatgc 
tgatcagagc tcctgcactt ttgataacct gagtcccggc ctggagtaca atgtcagtgt 
ttacactgtc aaggatgaca aggaaagtgt ccctatctct gataccatca tcccagctgt 
tcctcctccc actgacctgc gattcaccaa cattggtcca gacaccatgc gtgtcacctg 
ggctccaccc ccatccattg atttaaccaa cttcctggtg cgttactcac ctgtgaaaaa 
tgaggaagat gttgcagagt tgtcaatttc tccttcagac aatgcagtgg tcttaacaaa 
tctcctgcct ggtacagaat atgtagtgag tgtctccagt gtctacgaac aacatgagag 
cacacctctt agaggaagac agaaaacagg tcttgattcc ccaactggca ttgacttttc 
tgatattact gccaactctt ttactgtgca ctggattgct cctcgagcca ccatcactgg 
ctacaggatc cgccatcatc ccgagcactt cagtgggaga cctcgagaag atcgggtgcc 
ccactctcgg aattccatca ccctcaccaa cctcactcca ggcacagagt atgtggtcag 
catcgttgct cttaatggca gagaggaaag tcccttattg attggccaac aatcaacagt 
ttctgatgtt ccgagggacc tggaagttgt tgctgcgacc cccaccagcc tactgatcag 
ctgggatgct cctgctgtca cagtgagata ttacaggatc acttacggag aaacaggagg 
aaatagccct gtccaggagt tcactgtgcc tgggagcaag tctacagcta ccatcagcgg 
ccttaaacct ggagttgatt ataccatcac tgtgtatgct gtcactggcc gtggagacag 
ccccgcaagc agcaagccaa tttccattaa ttaccgaaca gaaattgaca aaccatccca 
gatgcaagtg accgatgttc aggacaacag cattagtgtc aagtggctgc cttcaagttc 
ccctgttact ggttacagag taaccaccac tcccaaaaat ggaccaggac caacaaaaac 
taaaactgca ggtccagatc aaacagaaat gactattgaa ggcttgcagc ccacagtgga 
gtatgtggtt agtgtctatg ctcagaatcc aagcggagag agtcagcctc tggttcagac 
tgcagtaacc aacattgatc gccctaaagg actggcattc actgatgtgg atgtcgattc 
catcaaaatt gcttgggaaa gcccacaggg gcaagtttcc aggtacaggg tgacctactc 
gagccctgag gatggaatcc atgagctatt ccctgcacct gatggtgaag aagacactgc 
agagctgcaa ggcctcagac cgggttctga gtacacagtc agtgtggttg ccttgcacga 
tgatatggag agccagcccc tgattggaac ccagtccaca gctattcctg caccaactga 
cctgaagttc actcaggtca cacccacaag cctgagcgcc cagtggacac cacccaatgt 
tcagctcact ggatatcgag tgcgggtgac ccccaaggag aagaccggac caatgaaaga 
aatcaacctt gctcctgaca gctcatccgt ggttgtatca ggacttatgg tggccaccaa 
atatgaagtg agtgtctatg ctcttaagga cactttgaca agcagaccag ctcagggtgt 
tgtcaccact ctggagaatg tcagcccacc aagaagggct cgtgtgacag atgctactga 
gaccaccatc accattagct ggagaaccaa gactgagacg atcactggct tccaagttga 
tgccgttcca gccaatggcc agactccaat ccagagaacc atcaagccag atgtcagaag 
ctacaccatc acaggtttac aaccaggcac tgactacaag atctacctgt acaccttgaa 
tgacaatgct cggagctccc ctgtggtcat cgacgcctcc actgccattg atgcaccatc 
caacctgcgt ttcctggcca ccacacccaa ttccttgctg gtatcatggc agccgccacg 
tgccaggatt accggctaca tcatcaagta tgagaagcct gggtctcctc ccagagaagt 
ggtccctcgg ccccgccctg gtgtcacaga ggctactatt actggcctgg aaccgggaac 
cgaatataca atttatgtca ttgccctgaa gaataatcag aagagcgagc ccctgattgg 
aaggaaaaag acagacgagc ttccccaact ggtaaccctt ccacacccca atcttcatgg 
accagagatc ttggatgttc cttccacagt tcaaaagacc cctttcgtca cccaccctgg 
gtatgacact ggaaatggta ttcagcttcc tggcacttct ggtcagcaac ccagtgttgg 
gcaacaaatg atctttgagg aacatggttt taggcggacc acaccgccca caacggccac 
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ccccataagg 
gacaaccatc 
tgttggcact 
tctgacaggc 
gcagaggcat 
gaaccaacct 
agatgagtgg 
tggaagtggt 
caagattgga 
tcttgggaac 
tgggaagaca 
ctgcacatgc 
tgaacccagt 
tcagagaaca 
ggctgacaga 
ctctctgcca 
tttctttctt 
tctccaagca 
gcctgttctg 
tttggtttgg 
gatatgacca 
ctggcccgca 
cagtactcac 
ttatttatca 
atgcagttga 
gtgccaaagc 



cataggccaa 
tcatgggccc 
gatgaagaac 
ctcaccagag 
aaggttcggg 
acggatgact 
gaacgaatgt 
catttcagat 
gagaagtggg 
ggaaaaggag 
taccacgtag 
tttggaggcc 
cccgaaggca 
aacactaatg 
gaagattccc 
agatccatct 
aagccctttg 
tcaccctggg 
cttcgaagta 
gatcaatagg 
aaattttaag 
atactgtagg 
tttttccaaa 
atttttccca 
taagaggaat 
tttactactg 



gaccataccc 
cattccagga 
ccttacagtt 
gtgccaccta 
aagaggttgt 
cgtgctttga 
ctgaatcagg 
gtgattcatc 
accgtcaggg 
aattcaagtg 
gagaacagtg 
agcggggctg 
ctactggcca 
ttaattgccc 
gagagtaaat 
aaactggagt 
ctctggagga 
agtttcctga 
ttcaataccg 
aaagcatatg 
taggaaagtc 
aacaagcatg 
tgatcctagt 
gtatttttat 
ttggtataat 
tggaaagaca 



gccgaatgta 
cacttctgag 
cagggttcct 
caacatcata 
taccgtgggc 
cccctacaca 
ctttaaactg 
tagatggtgc 
agaaaatggc 
tgaccctcat 
gcagaaggaa 
gcgctgtgac 
gtcctacaac 
aattgagtgc 
catctttcca 
gatgttagca 
agttctccag 
gggttttctc 
ctcagtattt 
cagccaacca 
acccaaacac 
atcttgttac 
aattgcctag 
acggaaaaaa 
tatggtgggt 
actgttttaa 



ggacaagaag 
tacatcattt 
ggaacttcta 
gtggaggcac 
aactctgtca 
gtttcccatt 
ttgtgccagt 
catgacaatg 
cagatgatga 
gaggcaacgt 
tatctcggtg 
aactgccgca 
cagtattctc 
ttcatgcctt 
atccagagga 
gacccagctt 
cttcagctca 
ataaatgagg 
taaatgaagt 
agatgcaaat 
ttctgctttc 
tgtgatattt 
aaatatcttt 
ttgtattgaa 
gattattttt 
taaaagattt 



ctctctctca 
catgtcatcc 
ccagtgccac 
tgaaagacca 
acgaaggctt 
atgccgttgg 
gcttaggctt 
gtgtgaacta 
gctgcacatg 
gttacgatga 
ccatttgctc 
gacctggggg 
agagatacca 
tagatgtaca 
acaagcatgt 
agagttcttc 
actcacagct 
gctgcacatt 
gattctaaga 
gttttgaaat 
acttaagtgt 
taaatatcca 
ctcttacctg 
aacacttagt 
tatactgtat 
acattccaca 



6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 



<210> 655 

<211> 1316 

<212> DNA 

<213> Homo sapiens 



<400> 655 

tcctaatacg 

ttgcgagctg 

gctgggtgcc 

agcccggcaa 

gaggggggcg 

tgatgcctgc 

aatgccaccc 

actttggagg 

aggcagccac 

gtcgccccac 

gttacctgga 

ccctgttgcc 

tgtgttgcca 

ccagcgggca 

cgtacagcaa 

ccaaggacaa 

tctggtttca 

ctacccctta 

agaccagaaa 

acaacaccct 



actcactata 
ggagcgattt 
ccctagattc 
ttatgccacc 
gaatctggtc 
tgtcaactat 
atgccctggg 
cgggtactac 
cctggccgcg 
tgagtttgcc 
cgtgtctgtg 
tgtggacagt 
gggagaacag 
gcaccctcct 
ggggcagttg 
gaggcgcaag 
gaaccgccgg 
agagatctcc 
cctgccaagc 
tcccaggcca 



gggctcgagc 
aaaacgcttt 
cccgcccccg 
ttggatggag 
gcccactccc 
gcccccttgg 
gtgccccagg 
tcctgccgag 
taccccgcgg 
ttctatccgg 
gtgcagactc 
taccagtctt 
aacccaccag 
gacgcctgcg 
cgggagctgg 
atctcggcag 
gtcaaagaga 
ttgcctgggt 
ccaggctggg 
ctggctgctg 



ggccgcccgg 
ggattccccc 
cacctcatga 
ccaaggatat 
ctctgaccag 
atctgccagg 
ggacgtcccc 
tgtcccggag 
agactcccac 
gatatccggg 
tgggtgctcc 
gggctctcgc 
gtcccttttg 
cctttcgtcg 
agcgggagta 
ccaccagcct 
agaaggttct 
gggaggagcg 
gccaaggact 
gactgttcct 



gcaggtcgaa 
ggcctgggtg 
gccgaccctc 
cgaaggcttg 
ccacccagcg 
ctcggcggag 
agctcccgtg 
ctcgctgaaa 
ggccggggaa 
aacctaccac 
tggagaaccg 
tggtggctgg 
gaaggcagca 
cggccgcaag 
tgcggctaac 
ctcggagcgc 
cgccaaggtg 
aaagtggggg 
ctgctgagag 
caggagcggc 



tgcaggcgac 
gggagagcga 
ggctccatgg 
ctgggagcgg 
gcgcctacgc 
ccgccaaagc 
ccttatggtt 
ccctgtgccc 
gagtacccca 
gctatggcca 
cgacatgact 
aacagccaga 
tttgcagact 
aaacgcattc 
aagttcatca 
cagattacca 
aagaacagcg 
tgtcctgggg 
gcccctagag 
ctgggtaccc 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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agtatgtgca gggagacgga accccatgtg acaggcccac tccaccaggg ttcccaaaga 
acctggccca gtcataatca ttcatcctca cagtggcaat aatcacgata accagt 

<210> 656 
<211> 566 
<212> DNA 
<213> Homo sapiens 

<400> 656 

cccaccaaac ccataaagag ggtgggtcga cccacgcgtc cgcggacgcg tgggaaatta 

ttgaattgga aacagaaata gaaaagttta aagctgagaa cgcatcttta gctaaacttc 

gcattgaacg agaaagtgcc ttggaaaaac tcaggaaaga aattgcagac ttcgaacaac 

agaaagcaaa agaattagct cgaatagaag agtttaaaaa ggaggagatg aggaagctac 

aaaaggaacg taaagttttt gaaaagtata ctacagctgc aagaactttt ccagataaaa 

aggaacgtga agaaatacag actttaaaac agcaaatagc agatttacgg gaagatttga 

aaagaaagga gaccaaatgg tcaagtacac acagccgtct cagaagccag atacaaatgt 

tagtcagaga gaacacagac ctccgggaag aaataaaagt gatggaaaga ttccgactgg 

atgcctggaa gagagcagaa gccatagaga gcagcctcga ggtggagaag aaggacaagc 

ttgcgaacac atctgttcga tttcaa 

<210> 657 

<211> 5067 

<212> DNA 

<213> Homo sapiens 

<400> 657 

gcccggacac ctgtctgcag catggataag tatgacgacc tgggcctgga ggccagtaaa 
ttcatcgagg acctgaacat gtatgaggcc tctaaggatg ggctcttccg agtggacaag 
ggtgcaggca acaaccccga gtttgaggaa actcgcaggg tgttcgccac caagatggcc 
aaaatccacc tccagcagca gcagcagcag ctcctgcagg aggagactct gcccaggggg 
agtagaggcc ctgtcaatgg agggggccgc ctgggcccac aggcccgttg ggaagfctgtg 
ggcagcaagc tgactgtgga tggtgctgcc aagcctcctc ttgctgcctc gacaggggca 
cctggggcag tcaccaccct cgctgctggg cagcccccgt acccaccgca ggagcagaga 
tccaggccat acctgcatgg cacgaggcat ggcagccagg actgtggttc cagggagagc 
ctggcgactt ctgagatgtc tgctttccac cagccaggcc cctgtgagga tccttcctgc 
ctcactcatg gagactatta tgacaacctc tccttggcaa gcccaaagtg gggtgacaaa 
ccaggagtgt cccccagcat cggcctgagt gtagggagtg ggtggcctag ctccccgggg 
agtgacccac cactgcccaa accctgcggg gaccatcccc taaatcaccg acagctctcc 
ctgagctcca gcaggtcttc tgagggtagc ctcggtggtc agaatagtgg cattggtggc 
cgcagcagcg agaagccaac aggcctttgg tccactgcct cctcccagcg ggtgagccct 
ggcctgcctt ccccaaactt ggagaacgga gcaccagctg tggggcctgt tcagcccagg 
actccttctg tgtcagcacc cttggccctg agctgcccca ggcaaggagg tcttccaaga 
tcaaactcgg ggctgggggg tgaggtttca ggtgtgatgt ccaaacccaa tgtggacccc 
caaccctggt tccaggatgg gcccaaatct tacctttcca gttctgcccc gtcatcctcg 
ccagctggcc tggacggttc acagcagggt gcggtccctg ggctggggcc gaagcctggc 
tgcacagacc ttggcactgg tcccaagctc agccccacca gtcttgtcca tccagtgatg 
tccaccctgc ctgagttatc ttgtaaagag ggtcccctgg gctggtcttc tgatggtagc 
ctgggatctg tgctcctgga cagccccagc tcccctaggg taaggctgcc ctgccagccc 
ctcgtcccag gtcctgagct gagaccctct gctgctgagt tgaaattaga agccctcacc 
caacgtctgg agcgagagat ggatgctcac ccgaaggctg attactttgg agcctgtgtg 
aaatgcagca aaggggtgtt tggggctggc caggcctgtc aggccatggg gaacctctac 
catgacacat gcttcacctg tgcagcttgc agccggaagc tgagaggaaa agccttttat 
tttgtcaacg gcaaagtgtt ttgtgaagaa gacttcctgt actctggttt ccagcagtcg 
gctgacaggt gttttctttg tggacatctg atcatggaca tgatcctgca agccctgggg 



1260 
1316 
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aagtcctacc accccggctg tttccgctgt gtcatctgta atgagtgttt ggatggggtg 1740 

cccttcaccg tggactcaga gaacaagatc tactgtgtcc gagattacca caaggtgctg 18 00 

gcccccaagt gtgcagcctg tgggcttccc atccttccac ctgagggctc agatgagacc I860 

atccgtgtcg tgtccatgga cagagactac cacgtggagt gttaccactg cgaggactgt 1920 

ggtctggagc tcaatgatga agatggccac cgctgttatc cgctggagga ccacctgttc 1980 

tgtcactcct gccacgtgaa gaggctggag aagagaccct catctacagc ccttcaccag 2040 

caccacttct agccagagcc acttgcagac atcacggcag gggatgagga gccggggttg 210 0 

ctgctgctgc ttccggtggc ccctggggtg gaagtggggt aggggaagag gaggggcagg 2160 

agggagagtt cctgtgagca tgtggggggt gcctttcctt taaccaggga ggtgaacact 2220 

acctgcctcc tgcgtgtatt ttccaagtgc ttttctctgt tgccacattt tcctcaggtt 22 80 

actcaggaaa atgctccagc atgtgcgagc acatgacctg aggttgcatc atagcaccaa 2340 

aggaatcctc ctgtcccctc tgggaacatt tcatgcttca gagggagagg tttttattga 2400 

gcttgtttca caatatcccc ttgaagggac agctcagctg ccaatacatt caaccctttc 2460 

tcttccttca ggaaaatacc tatacccaaa tgttccctcc cccgacatat atcatggcat 2 52 0 

gacttaaggc ttcttttcac ctgagagctt cagttcttct gcagaatggc tgcaaattta 2580 

attgcattaa ggcaagaagg aagctctaat gtgtgctttg tatcctaaga taaatttgct 2 640 

tagaaaacca gagtcaagat ttgaaatagg tgaggcaggg tttcctcctt agacactgac 2700 

agcattctcc gtaccccttc aaatccttac tctcctaaag gcagctgagt ccgcgacaga 2760 

aatttgccct atgggagtaa aacatacttt gggagaagaa cttggtgcag gcaccaggat 2820 

tttttttttt gcccacgtgt ttgcgctgtt tttctctgga gttctcaaga gttggtgact 288 0 

tggaaggccg cttctgcaag gcaagtctca ggaacccatg caggtacatc gcttgcacct 2 94 0 

gtttttagct tatttaatga cgggcttttg ggaagagctg cccgcatact gagagacagc 3 00 0 

ttcttataaa caaggagagt ttttgtgtgt gcgagatctc taagccagcg tgggagggag 3 060 

cgcctcagga taagttatta tattcatttc gttggtttct ctcctgccca attcttggca 312 0 

caggcattat gtttgaagaa accaggataa ggtacactgc ttttgtctgt ttaatttttt 3180 

tagttgtttc ccttcacttt cagtcttcca cacacaaaaa atacctcaca gagcttcacc 3240 

aaatcacaga ttcaggagga atttggcttt cacactggac tcagatacct tcttcagtgt 33 0 0 

gttggaaatc actggcttca cacaggccca actccaactg gtcagggcag agtgatcgta 33 60 

actaaaggtc agtggggaat agatccgatt cagtgctttt gccttatgca tttcagcatc 342 0 

ctggctcccc agggtggcag gagctgagga agggccacac actggcaaga tttcaagacc 3 480 

actctctgca ctgaagaggt aaaatttgca ctgcaagtca catccctgag gccagaggtc 354 0 

agtacccttt ggtatttcga ttagaagaag ctgcaaaaga aaggcagccc attttaccat 3600 

tgccagccag gccggggaca caggagccgg tgtgtgcact ctgcctccta acattgcacc 3660 

cagagcaaga ggactgggtg ctgggctgca gaggccggtc agtggagccc ctagcacgtg 3 72 0 

tgaactcagg cttttcattg ggcccggctc cacttctagg ccatgttttg actcatttgg 3780 

taaccattgc ctgtaagcag cacagaattg gtgccatgga ttatcttttc catgttgatg 3 84 0 

gaattcattc tgttggaatc ctttggccag atgtcacttc agccagggtg tgcatcatca 3900 

ttggttcttt ttcacaggct gagcctcctg aaaacccatg aacgctgggg ctggggaagt 3 960 

gaaccctgag gtggggaccc tctcttccca tcaaatcatc cagctcagtg tggggcgtgg 4 020 

caggggggta aatgaagcca gccaatgtgt taacctgtct ctgtcaacct aagaatgttg 40 80 

gccttactga cacacctttg ctccatgttc aagaccagaa gtagctggga tttgtttgca 4140 

aattgggtaa ttagtttaaa aatctgtgat tacattttta aatgaaattt tcaaagtggc 420 0 

ctagattgag gtgattcaga taggtttgcg aatataccat tttatattgt tgagaaagaa 4260 

caaaaaggga atttccagat gtcctagaaa tcctagcaac agatttctct ggttgtcagt 432 0 

ttccctggag aaggcgccag ataggaatct ccaatcagtt gtttttctct tcgcttcagg 4380 

cccttacaca aaagccatga agagatgttc acctacccgg tattttaaat gttctgtaaa 4440 

ttattagcca aatagaactg taatggggtt gtatttatgg gcgcctagaa agaaaacaca 4500 

aggacttggt aggccaggaa gaaaagattt taaaatttag aatgaatagc ccttctgggt 4560 

tttctttttg acaattcttg gacttgaggt aaaacaagga ggattgtggc cggatttcag 462 0 

atcccaaagc cagcctccat cttaggcctt tgcctcattg tgccttttag gttttcttac 4680 

ccaccgtctc ctgttttgtc ttttttttct tttctcctac ccctatcttg ggacattcag 474 0 

aaactgcctg ggtggtttga gaagagacaa cccagtttga tctgcaatac aaggatccat 4800 

tcgtaatctc tctctcactg atgttattcc cccatctgcc gtcttggttc atctcaccac 4 860 

agaagggcat ttagtcctac ccagccatcg gctgcggtat gacagcagga tggcacttcc 4 920 
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catttctctg tggttagtgc 
gctgttgagt cctgggtggc 
gtcatcgcct tgtttccatg 



tcgagtgaaa acctctttca gctgagtcct 
tgatggaatg attgaggagg tctggtcacc 
ggcttct 



ctgaggttct 
ctcaagcgcc 



4980 
5040 
5067 



<210> 658 
<211> 950 
<212> DNA 
<213> Homo sapiens 

<400> 658 

tcgaccggat ccgaattccc attgtgcact aaagcgtctc cctgctccgc ggcccgggct 60 

ggcgggcggg cgctcggctg gcggctgcag cagcagaggg agacccgcgg caaccccggc 120 

aacccagggc tcggcgtcgc tgccaccatg acgggaagca atatgtcgga cgccttggcc 180 

aacgccgtgt gccagcgctg ccaggcccgc ttctcccccg ccgagcgcat tgtcaacagc 240 

aatggggagc tgtaccatga gcactgcttc gtgtgtgccc agtgcttccg gcccttcccc 3 00 

gaggggctct tctatgagtt tgaaggccgg aagtactgcg aacacgactt ccaaatgctg 3 60 

tttgctccgt gctgtggatc ctgcggtgag ttcatcattg gccgcgtcat caaggccatg 420 

aacaacaact ggcacccggg ctgcttccgc tgcgagctgt gtgatgtgga gctggctgac 480 

ctgggctttg tgaagaatgc cggcaggcat ctctgccggc cttgccacaa ccgtgagaag 54 0 

gccaagggcc tgggcaagta catctgccag cggtgccacc tggtcatcga cgagcagccc 600 

ctcatgttca ggagcgacgc ctaccaccct gaccacttca actgcaccca ctgtgggaag 660 

gagctgacag ccgaggcccg cgagctgaag ggtgagctct actgcctgcc ctgccatgac 720 

aagatgggcg ttcccatctg cggggcctgc cgccggccca tcgagggccg agtggtcaac 780 

gcgctgggca agcagtggca cgtggagcac tttgtctgtg ccaagtgtga gaagccattc 84 0 

ctggggcacc ggcactatga gaagaagggc ctggcctact gcgagcttta gtgcacaatg 900 

ggcggccgcc tcgagtctag actcgagtag ataattgagc ggaatttctt 950 

<210> 659 

<211> 2346 

<212> DNA 

<213> Homo sapiens 



<400> 659 



ccgccgtcgc ccccgcctcc 
cccccgggcc ttgcagcggg 
gcgggcagca gcgccgcggc 
aaggagttgc agcggcggct 
ctgcctcggt cctggagccc 
ctgcgggtgc actacaaagg 
acgcatccaa taccagcagc 
ggaagagatg gttacatggg 
ccaggttggg ataagcattc 
tctggaactg gacaacctta 
gttaatctta tcaacaatac 
ttcactgacc taccgccaaa 
gtcgatgcca attttgggca 
tggagaacca aaatccaggc 
tggcagacca tgatacaaaa 
acagcagagg cctttgccag 
aagaatagac aaagaattca 
acaacacaac agttataccc 
aaagtgcgtc agtttataga 
ggccgaagtc caaagtctca 



ccctgcctca gcggctgccc ccgccagcgg 
ccccggcccg gctggagggg ccccgacccc 
ccccttccct cacggggact cggccctgaa 
gaagcgtctc tacccggccg tggacgaaca 
gaaggacaag ttcagctaca tcggcctctc 
tcatggcaaa accccaaaag atgccgcgtc 
ctgtgggatt tattattttg aagtaaaaat 
aattggtctt tctgctcaag gtgtgaacat 
atatggttac catggggatg atggacattc 
tggaccaact ttcactactg gtgatgtcat 
ctgcttttac accaagaatg gacatagttt 
tttgtatcct actgtggggc ttcaaacacc 
acatcctttc gtgtttgata tagaagacta 
acagatagat cgatttccta tcggagatcg 
aatggtttca tcttatttag tccaccatgg 
atctacagac cagaccgttc tagaagaatt 
gaaattggta ttagcaggaa gaatgggaga 
aagtttactt gaaagaaatc ctaatctcct 
aatggtgaat ggtacagata gtgaagtacg 
agacagttat cctgttagtc ctcgaccttt 



gccgcccgct 
agctctggtg 
cgagcaggag 
agagacgccg 
tcagaacaac 
agttcgagcc 
tgtcagtaag 
gaatagacta 
gttttgttct 
tggctgttgt 
aggtattgct 
aggagaagtg 
tatgcgggag 
agaaggagaa 
gtactgtgcc 
agcttccatt 
agccattgaa 
tttcacatta 
atgtttggga 
tagtagtcca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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agtatgagcc ccagccatgg aatgaatatc cacaatttag catcaggcaa aggaagcacc 1260 
gcacattttt caggttttga aagttgtagt aatggtgtaa tatcaaataa agcacatcaa 1320 
tcatattgcc atagtaataa acaccagtca tccaacttga atgtaccaga actaaacagt 
ataaatatgt caagatcaca gcaagttaat aacttcacca gtaatgatgt agacatggaa 
acagatcact actccaatgg agttggagaa acttcatcca atggtttcct aaatggtagc 
tctaaacatg accacgaaat ggaagattgt gacaccgaaa tggaagttga ttcaagtcag 
ttgagacgcc agttgtgtgg aggaagtcag gccgccatag aaagaatgat ccactttgga 
cgagagctgc aagcaatgag tgaacagcta aggagagact gtggcaagaa cactgcaaac 1680 
aaaaaaatgt tgaaggatgc attcagtcta ctagcatatt cagatccctg gaacagccca 1740 
gttggaaatc agcttgaccc gattcagaga gaacctgtgt gctcagctct taacagtgca 1800 
atattagaaa cccacaatct gccaaagcaa cctccacttg ccctagcaat gggacaggcc 1860 
acacaatgtc taggactgat ggctcgatca ggaattggat cctgcgcatt tgccacagtg 1920 
gaagactacc tacattagct atgcatttca agagctcaca cttatattgt ggcatatagt 1980 
caacatggaa gtagaccagc tctgctgatt tgaaatttag attttttaaa ttatgtactg 2 040 
gggacaggtt tttgtcgctt tacattgctt cctagtttac agcatgatgc aaatgatttt 2100 
ctaacttagt gttaggagaa attattttcc atctttaacc tcttagttgt ctaagagtta 2160 
aatattactg aatttcagac gttcaaattg atcatcacaa atcctttaaa acaattacct ■ ^ 

aaaagaaacc aaaaatcctg ccttctttgt gggggagggg ggagagaggg gaaggaaatg 
gaacaagttg tgtttgtgtt agcatgtggg tgatgtaaac ttcaaattgg gagatgttcc 
gacccc 



<210> 660 

<211> 2512 

<212> DNA 

<213> Homo sapiens 



1380 
1440 
1500 
1560 
1620 



2220 
2280 
2340 
2346 



60 



180 
240 
300 



<400> 660 

caatgcactg acggatatga gtgggatcct gtgagacagc aatgcaaaga tattgatgaa 

tgtgacattg tcccagacgc ttgtaaaggt ggaatgaagt gtgtcaacca ctatggagga 120 

tacctctgcc ttccgaaaac agcccagatt attgtcaata atgaacagcc tcagcaggaa ' ™ 
acacaaccag cagaaggaac ctcaggggca accaccgggg ttgtagctgc cagcagcatg 
gcaaccagtg gagtgttgcc cgggggtggt tttgtggcca gtgctgctgc agtcgcaggc 

cctgaaatgc agactggccg aaataacttt gtcatccggc ggaacccagc tgaccctcag 360 

cgcattccct ccaacccttc ccaccgtatc cagtgtgcag caggctacga gcaaagtgaa 420 

cacaacgtgt gccaagacat agacgagtgc actgcaggga cgcacaactg tagagcagac 4 80 

caagtgtgca tcaatttacg gggatccttt gcatgtcagt gccctcctgg atatcagaag 540 

cgaggggagc agtgcgtaga catagatgaa tgtaccatcc ctccatattg ccaccaaaga 600 
tgcgtgaata caccaggctc attttattgc cagtgcagtc ctgggtttca attggcagca 
aacaactata cctgcgtaga tataaatgaa tgtgatgcca gcaatcaatg tgctcagcag 
tgctacaaca ttcttggttc attcatctgt cagtgcaatc aaggatatga gctaagcagt 

gacaggctca actgtgaaga cattgatgaa tgcagaacct caagctacct gtgtcaatat 84 0 

caatgtgtca atgaacctgg gaaattctca tgtatgtgcc cccagggata ccaagtggtg 900 

agaagtagaa catgtcaaga tataaatgag tgtgagacca caaatgaatg ccgggaggat 96 0 

gaaatgtgtt ggaattatca tggcggcttc cgttgttatc cacgaaatcc ttgtcaagat 1020 

ccctacattc taacaccaga gaaccgatgt gtttgcccag tctcaaatgc catgtgccga 1080 

gaactgcccc agtcaatagt ctacaaatac atgagcatcc gatctgatag gtctgtgcca 1140 

tcagacatct tccagataca ggccacaact atttatgcca acaccatcaa tacttttcgg 1200 

attaaatctg gaaatgaaaa tggagagttc tacctacgac aaacaagtcc tgtaagtgca 1260 

atgcttgtgc tcgtgaagtc attatcagga ccaagagaac atatcgtgga cctggagatg 1320 

ctgacagtca gcagtatagg gaccttccgc acaagctctg tgttaagatt gacaataata 13 80 

gtggggccat tttcatttta gtcttttcta agagtcaacc acaggcattt aagtcagcca 144 0 

aagaatattg ttaccttaaa gcactatttt atttatagat atatctagtg catctacatc 1500 

tctatactgt acactcaccc ataacaaaca attacaccat ggtataaagt gggcatttaa 1560 

tatgtaaaga ttcaaagttt gtctttatta ctatatgtaa attagacatt aatccactaa 162 0 



660 
720 
780 
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actggtcttc 
taaaagtggg 
cactaacagt 
atgagttttt 
aaaatgggga 
tgtttattat 
taattttgtc 
cagaattccc 
ggaggatatg 
ctcggaaaat 
taaaaggtat 
actaatttat 
tgatggtttt 
ccaattctat 
gtatcatatt 



ttcaagagag ctaagtatac actatctggt 
accaagcaat gatgatcttc tgtggtgctt 
ctcataagga ggcagccatc ataaccattg 
aactgctttg taagaaaatg gaaaaggtca 
tctgccatat ttgtgttggt ttttattttc 
atagtaataa atcattgctg tacaacatgc 
agaaatttta gattgtgaat attttgtaaa 
aaaatgaacc agataccccc tagaaaatta 
agaaaataaa ttccttctaa accacattgg 
ataataacat ccctgaattc aggcattcac 
ttcactggag aagttttaat ttctaagtaa 
aactaaaatt tctcatcttc gtacttgatg 
tattcctggc atccagagtg acagtgaact 
ggaatatttt atacgtctcc ttgtttaaaa 
tttaaataaa aataaatatt cctttagaag 



gaaacttgga ttctttccta 
aaggaaactt actagagctc 
aatagcatgc aagggtaaga 
ataaagatat atttctttag 
atatccagcc taaaggtggt 
tggtttctgt agggtatttt 
aaacagtaag caaaattttc 
tactattgag aaatctatgg 
aactgacctg aagaagcaaa 
aagatgcaga acaaaatgga 
aatttaaatc ctaacacttc 
ctcacagagg aagaaaatga 
taagcaaatt accctcctac 
tctgactgct ttactttgat 
atcactctaa aa 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2512 



<210> 661 

<211> 2306 

<212> DNA 

<213> Homo sapiens 

<400> 661 

gggcgggagc tgcacgcgcc gtggctccgg atctcttcgt ctttgcagcg tacgcccgag 60 

tcggtcagcg ccggaggacc tcagcagcca tgtcgaagcc ccatagtgaa gccgggactg 120 

ccttcattca gacccagcag ctgcacgcag ccatggctga cacattcctg gagcacatgt 180 

gccgcctgga cattgattca ccacccatca cagcccggaa cactggcatc atctgtacca 240 

ttggcccagc ttcccgatca gtggagacgt tgaaggagat gattaagtct ggaatgaatg 3 00 

tggctcgtct gaacttctct catggaactc atgagtacca tgcggagacc atcaagaatg 3 60 

tgcgcacagc cacggaaagc tttgcttctg acccctacct ctaccggccc gttgctgtgg 420 

ctctagacac taaaggacct gagatccgaa ctgggctcat caagggcagc ggcactgcag 480 

agctggagct gaagaaggga gccactctca aaatcacgct ggataacgcc tacatggaaa 540 

agtgtgacga gaacatcctg tggctggact acaagaacat ctgcaaggtg gtggaagtgg 600 

gcagcaagat ctacgtggat gatgggctta tttctctcca ggtgaagcag aaaggtgccg 660 

acttcctggt gacggaggtg gaaaatggtg gctccttggg cagcaagaag ggtgtgaacc 720 

ttcctggggc tgctgtggac ttgcctgctg tgtcggagaa ggacatccag gatctgaagt 780 

ttggggtcga gcaggatgtt gatatggtgt ttgcgtcatt catccgcaag gcatctgatg 840 

tccatgaagt taggaaggtc ctgggagaga agggaaagaa catcaagatt atcagcaaaa 900 

tcgagaatca tgagggggtt cggaggtttg atgaaatcct ggaggccagt gatgggatca 960 

tggtggctcg tggtgatcta ggcattgaga ttcctgcaga gaaggtcttc cttgctcaga 102 0 

agatgatgat tggacggtgc aaccgagctg ggaagcctgt catctgtgct actcagatgc 1080 

tggagagcat gatcaagaag ccccgcccca ctcgggctga aggcagtgat gtggccaatg 114 0 

cagtcctgga tggagccgac tgcatcatgc tgtctggaga aacagccaaa ggggactatc 1200 

ctctggaggc tgtgcgcatg cagcacctga ttgcccgtga ggcagaggct gccatctacc 12 60 

acttgcaatt atttgaggaa ctccgccgcc tggcgcccat taccagcgac cccacagaag 132 0 

ccaccgccgt gggtgccgtg gaggcctcct tcaagtgctg cagtggggcc ataatcgtcc 13 80 

tcaccaagtc tggcaggtct gctcaccagg tggccagata ccgcccacgt gcccccatca 144 0 

ttgctgtgac ccggaatccc cagacagctc gtcaggccca cctgtaccgt ggcatcttcc 1500 

ctgtgctgtg caaggaccca gtccaggagg cctgggctga ggacgtggac ctccgggtga 1560 

actttgccat gaatgttggc aaggcccgag gcttcttcaa gaagggagat gtggtcattg 162 0 

tgctgaccgg atggcgccct ggctccggct tcaccaacac catgcgtgtt gttcctgtgc 1680 

cgtgatggac cccagagccc ctcctccagc ccctgtccca cccccttccc ccagcccatc 1740 

cattaggcca gcaacgcttg tagaactcac tctgggctgt aacgtggcac tggtaggttg 1800 

ggacaccagg gaagaagatc aacgcctcac tgaaacatgg ctgtgtttgc agcctgctct 1860 

agtgggacag cccagagcct ggctgcccat catgtggccc cacccaatca agggaagaag 192 0 
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gaggaatgct ggactggagg cccctggagc cagatggcaa gagggtgaca gcttcctttc 198 0 

ctgtgtgtac tctgtccagt tcctttagaa aaaatggatg cccagaggac tcccaaccct 2040 

ggcttggggt caagaaacag ccagcaagag ttaggggcct tagggcactg ggctgttgtt 2100 

ccattgaagc cgactctggc cctggccctt acttgcttct ctagctctct aggcctctcc 2160 

agtttgcacc tgtccccacc ctccactcag ctgtcctgca gcaaacactc caccctccac 2220 

cttccatttt cccccactac tgcagcacct ccaggcctgt tgctatagag cctacctgta 22 80 

tgtcaataaa caacagctga agcacc 23 06 

<210> 662 

<211> 2656 

<212> DNA 

<213> Homo sapiens 



<400> 662 

gaattcgcgg ccgcagagtc cccgggccaa gatggctgcg cggtgctcca cacgctggtt 60 

gctggtggtt gtggggaccc cgcggctgcc ggctatatcg ggtagagggg cccggccgcc 120 

cagggagggc gtggtggggg catggctgag ccgcaagctg agcgtccccg cctttgcgtc 180 

ttccctgacc tcttgcggcc cccgagcgct gctgacattg agacctggtg tcagccttac 24 0 

aggaacaaaa cataaccctt tcatttgtac tgcctccttc cacacgagtg cccctttggc 300 

caaagaagat tattatcaga tattaggagt gcctcgaaat gccagccaga aagagatcaa 360 

gaaagcctat tatcagcttg ccaagaagta tcaccctgac acaaataagg atgatcccaa 42 0 

agccaaggag aagttctccc agctggcaga agcctatgag gttttgagtg atgaggtgaa 4 80 

gaggaagcag tacgatgcct acggctctgc aggcttcgat cctggggcca gcggctccca 54 0 

gcatagctac tggaagggag gccccactgt ggaccccgag gagctgttca ggaagatctt 600 

tggcgagttc tcatcctctt catttggaga tttccagacc gtgtttgatc agcctcagga 660 

atacttcatg gagttgacat tcaatcaagc tgcaaagggg gtcaacaagg agttcaccgt 72 0 

gaacatcatg gacacgtgtg agcgctgcaa cggcaagggg aacgagcccg gcaccaaggt 780 

gcagcattgc cactactgtg gcggctccgg catggaaacc atcaacacag gcccttttgt 84 0 

gatgcgttcc acgtgtagga gatgtggtgg ccgcggctcc atcatcatat cgccctgtgt 900 

ggtctgcagg ggagcaggac aagccaagca gaaaaagcga gtgatgatcc ctgtgcctgc 960 

aggagtcgag gatggccaga ccgtgaggat gcctgtggga aaaagggaaa ttttcattac 102 0 

gttcagggtg cagaaaagcc ctgtgttccg gagggacggc gcagacatcc actccgacct 1080 

ctttatttct atagctcagg ctcttcttgg gggaacagcc agagcccagg gcctgtacga 1140 

gacgatcaac gtgacgatcc cccctgggac tcagacagac cagaagattc ggatgggtgg 12 00 

gaaaggcatc ccccggatta acagctacgg ctacggagac cactacatcc acatcaagat 1260 

acgagttcca aagaggctaa cgagccggca gcagagcctg atcctgagct acgccgagga 132 0 

cgagacagat gtggagggga cggtgaacgg cgtcaccctc accagctctg gtggcagcac 1380 

catggatagc tccgcaggaa gcaaggctag gcgtgaggct ggggaggacg aggagggatt 144 0 

cctttccaaa cttaagaaaa tgtttacctc atgatatccc agccgaggaa aaagatccac 1500 

tggaaactag gccgggaagc agcagcccct ccaagggcca gggcacctgg gagacgggag 1560 

gattccagaa cagcagcact gagctcccac ccgcagagcc tctggacggc cttggcaaca 162 0 

gcaaaatcat gggacaacac ctctctccac ggaaaggtca cagtggacag cccgggcagt 1680 

aggatgcagc cccagaggct ggtggcagtt tcctgtccat tggtaggtga cggccccctg 1740 

gtcagcagag gagaggttag atcttgcagg ctaaaactct aatttggaat tgaatattgt 1800 

ggatatctta gttaaaggcc atgcttacag cttagaaatg aagccttaag ctgcatcaag 1860 

ttacgaagtg attaatttcc ttctcagcaa acctccggga ggttccagaa tgagttcttc 1920 

ctgacaggtt gtcttcactg ggagcgtggg gcccccaggc cccaccagca ccgtcctccc 1980 

ctaatgaggg gccctgccga ggcatcagct gctctgctca gttagttttt attcccgggg 2 04 0 

taccaagcag ctgcacagtc ggtgcctggg aagcacgtta aaggcccaga gagatcctgg 2100 

gggttctgct ctgaccgtgt gggtggtgat ccttgtcagg atgtacagtc cttgctccca 2160 

ccccatccgg gatggccgcc tgtccctgac tattgagtcc tgttgttgta agccaggcat 2220 

ggagggctcc tgcccttctg ctgagccaca gcccattgca gcactgtgct ggccagactt 22 80 

cagctgcctt gggaactgaa gccctgccac tgttgctagt caggggcttg gttctcccac 2340 

ttacactgtt gacatctatt ttctgaagtg tgtttaaatt attcagtgct aatcattgtt 24 00 
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ttttcctttg taaatgttga ttcagaaaag gaaagcacag gctaagcagt tgaaggttcc 2460 

ccaccattca gtgagagcag aacccccatt ccccagcctc tgctggtagc atgtcgcagt 252 0 

ttccatgtgt ttcaggatct tcgggctgtc gttagacagg ttaatgaaga acacttctca 2580 

acagtttcct ttttgttttc ctttataatt cactaaaata aagcatctat tagtgtctga 2640 

aaaaaaaaaa aaaaaa 265 6 



<210> 663 
<211> 2217 
<212> DNA 

<213> Homo sapiens 



<400> 663 

gcggacccgg cgccgaggcg gccacccgag acgcggcgcg cacgctccgg cctgcgcagc 60 

ccggcccggc catggcggcc ccccgcccgt ctcccgcgat ctccgtttcg gtctcggctc 12 0 

cggcttttta cgccccgcag aagaagttcg gccctgtggt ggccccaaag cccaaagtga 180 

atcccttccg gcccggggac agcgagcctc ccccggcacc cggggcccag cgcgcacaga 24 0 

tgggccgggt gggcgagatt cccccgccgc ccccggaaga ctttcccctg cctccacctc 3 00 

cccttgctgg ggatggcgac gatgcagagg gtgctctggg aggtgccttc ccgccgcccc 3 60 

ctcccccgat cgaggaatca tttccccctg cgcctctgga ggaggagatc ttcccttccc 42 0 

cgccgcctcc tccggaggag gagggagggc ctgaggcccc cataccgccc ccaccacagc 4 80 

ccagggagaa ggtgagcagt attgatttgg agatcgactc tctgtcctca ctgctggatg 54 0 

acatgaccaa gaatgatcct ttcaaagccc gggtgtcatc tggatatgtg cccccaccag 60 0 

tggccactcc attcagttcc aagtccagta ccaagcctgc agccgggggc acagcacccc 660 

tgcctccttg gaagtcccct tccagctccc agcctctgcc ccaggttccg gctccggctc 72 0 

agagccagac acagttccat gttcagcccc agccccagcc caagcctcag gtccaactcc 780 

atgtccagtc ccagacccag cctgtgtctt tggctaacac ccagccccga gggcccccag 840 

cctcatctcc ggctccagcc cctaagtttt ctccagtgac tcctaagttt actcctgtgg 900 

cttccaagtt cagtcctgga gccccaggtg gatctgggtc acaaccaaat caaaaattgg 960 

ggcaccccga agctctttct gctggcacag gctcccctca acctcccagc ttcacctatg 102 0 

cccagcagag ggagaagccc cgagtgcagg agaagcagca ccccgtgccc ccaccggctc 1080 

agaaccaaaa ccaggtgcgc tcccctgggg ccccagggcc cctgactctg aaggaggtgg 114 0 

aggagctgga gcagctgacc cagcagctaa tgcaggacat ggagcatcct cagaggcaga 12 00 

atgtggctgt caacgaactc tgcggccgat gccatcaacc cctggcccgg gcgcagccag 1260 

ccgtccgcgc tctagggcag ctgttccaca tcgcctgctt cacctgccac cagtgtgcgc 1320 

agcagctcca gggccagcag ttctacagtc tggagggggc gccgtactgc gagggctgtt 13 8 0 

acactgacac cctggagaag tgtaacacct gcggggagcc catcactgac cgcatgctga 144 0 

gggccacggg caaggcctat cacccgcact gcttcacctg tgtggtctgc gcccgccccc 1500 

tggagggcac ctccttcatc gtggaccagg ccaaccggcc ccactgtgtc cccgactacc 1560 

acaagcagta cgccccgagg tgctccgtct gctctgagcc catcatgcct gagcctggcc 162 0 

gagatgagac tgtgcgagtg gtcgccctgg acaagaactt ccacatgaag tgttacaagt 168 0 

gtgaggactg cgggaagccc ctgtcgattg aggcagatga caatggctgc ttccccctgg 1740 

acggtcacgt gctctgtcgg aagtgccaca ctgctagagc ccagacctga gtgaggacag 18 00 

gccctcttca gaccgcagtc catgccccat tgtggaccac ccacactgag accacctgcc 1860 

cccacctcag ttattgtttt gatgtctagc ccctcccatt tccaacccct ccctagcatc 192 0 

ccaggtgccc tgacccagga cccaacatgg tctagggatg caggatcccc gccctggggt 1980 

ctggtcctcg cccatcctgc agggattgcc caccgtcttc cagacacccc acctgagggg 2 040 

ggcaccaggt ttagtgctgc tgctttcact gctgcacccg cgccctcggc cggccccccg 210 0 

agcagccttt gtactctgct tgcggagggc tgggagaccc tccaggacat tcccaccctc 2160 

ccccatgctg ccaagttgta gctatagcta caaataaaaa aaaaccttgt tttccag 2217 
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<400> 664 

gaggcggcca 

gatgaaatga 

ggtgggagag 

agaatacgac 

gaggtctgca 

atcaagcccc 

atcatcaagc 

gaatttgaga 

ctaactcctg 

gttgcttttg 

caggacatcc 

gaggaacatt 

gagggacaga 

ggctcaggca 

ctgcactcga 

cagtgcttcc 

cacaacaaag 

atggagcttc 

ataaaccgca 

cagatcaggc 

gatgagcgga 

atgagcaacg 

caggacctag 

tattctgagg 

cccggcagct 

ctggatgtca 

cgcaaggtcc 

gtgctagact 

aaatctcttc 

cagaacacat 

ggggaatgtg 

gatttcgttc 
acccatgtga 
gtgtatgccg 
cagaccaccc 
ctcagggact 
gagacggtgc 
cgcaagatca 
attatctcag 
acagaagatc 
atgaacaact 
gaggtccagg 
gtccaggacc 
cagcatcgga 
caggtgctgg 
agctcgttac 
gagaaaacac 
aaccactacg 
cagctcatgc 
aaaacctcag 
gaagactggt 
cccatgatca 
cggaggaatg 



aggacctggc 
aagctatgga 
ataaacgtgg 
aggaggatct 
agcgtggctt 
ttctgaagat 
cagacaactt 
caaatatggt 
agtttgatgg 
aagactacat 
tagctaagaa 
ctcagctgaa 
agctgcttca 
atgcggacct 
cacggcagca 
agctgaggct 
gcctgtttct 
agacgcagca 
tcatgtcggt 
agatcgcgag 
gcaccttgct 
tggattcatg 
aagatgccat 
tcagccaaga 
ccgattccct 
tccacgaggt 
ggctgcatca 
ggatcgagaa 
atcgggccag 
acaccaatgc 
accccgaaga 
ggcgtgttga 
aagagctgtg 
agtcggtgga 
tgcaggtgac 
ctgccatctc 
tgcagcagct 
agctggagct 
acctcgagtc 
tcacgattgc 
tgacttttga 
cctctggtgt 
tgctggagtt 
aacacctgga 
gttggatccg 
aagaggcaga 
atcagagcgc 
acatggacat 
tcaagatgga 
agcaggtctg 
gtggcggggc 
gcaagcacct 
cagacgtctt 



cgacatcgcg 
tgttttacca 
aggtcccatt 
caggagactc 
cacggtgatc 
cctgcaggag 
ctggcagaaa 
ctctttagaa 
ctgcctggaa 
tagcaatgcc 
ggagctgcct 
gaagaaggtg 
gaggatacag 
gcagaacctc 
tctgcaccag 
gtttgaacag 
aaacagctac 
caatcacttt 
ggccaatcgt 
tcagctggag 
ggacatgtcc 
gtgtaaagct 
tcatcaccac 
tgggaagtcg 
gacagcctct 
gctgcaccac 
gaggctgcag 
ccacggagaa 
agcattgcag 
ggataaatta 
gatttatcag 
gcagcgaaag 
gacgtggctg 
ggccgtgcag 
tgtcaacgtg 
cagtaacaag 
ggacgaggcg 
cttcctgcac 
ttggaatgat 
agagcagcgc 
cgtcatccac 
ggagctgctg 
tcttcatgaa 
gcagtgcgtg 
caacggagag 
gcagctccag 
gctgcaggtg 
gatccgggac 
agatcgcctc 
cagcgtcctc 
ggataagctg 
ggagcagaag 
cctgaaatac 



gccttcttcc 
attttgaagg 
ttaacgtttc 
atttcctatc 
gtggacatgc 
tccttcccct 
cagaggacta 
ggccttacca 
tacaaccacg 
acccacatgc 
caggatttag 
attaaggccc 
agcagtgaaa 
ttgcccaagg 
atgtggcatg 
gatgctgaga 
acagagattg 
gccatgaact 
ctggtggagt 
caggagtgga 
tccattttcc 
tgcggtgagg 
cagggaatat 
ctccttgaca 
gccaactact 
cagcggcacg 
ctgtgtgttt 
gcatttctga 
aaacgtcatg 
ctggaagcag 
gctgcccatc 
atcctactgg 
gaggagctgc 
gacctcatca 
atcaaggaag 
accccccaca 
cagtcgcaga 
gtgcgcatct 
gagctttctc 
ctccagcacc 
caagggcaag 
tgtgatagag 
aaacagcagg 
cagctgcgcc 
tccatgttaa 
cgagagcacg 
cagcagaagg 
tgcgccgaga 
aagctcgtca 
gagagcctgg 
ggcccaaact 
gaggcattcc 
ctgcacagga 



gatccgggtt 

aaaaagttgc 

cggcccgcag 

tagcctgtat 

gtgggtccaa 

gctgcatcca 

attttggcag 

aagtagttga 

aagaatggat 

tgtctcggct 

agggggctcg 

ccatcgagga 

gctttcccaa 

tgtccaccat 

tgaggaagct 

agatgtttga 

ggaccagcca 

gtatgaacgt 

ctggccacta 

aggcgtttgc 

accagaaggc 

tagaccttcc 

atgaacatat 

agctccagcg 

ccaaggccgt 

tgagaacaat 

tccagcagga 

gcaaacatac 

aagattttga 

cagaacagct 

agctggaaga 

acatgtcagt 

agaaggagct 

agcgctttgg 

gggaggacct 

acagctccat 

tggaggagct 

tcgagaggga 

agcaaatgaa 

atgcagacaa 

atcttctgca 

atgtagacat 

aattggattt 

acctgcaggc 

atgccggact 

agcagttcca 

cagaagccat 

aggtggcgtc 

acgcctcfcgt 

aacaggagta 

ctgagacgga 

tgaaggcttg 

acagcgtgaa 



tcgaaaaaac 
atacctttca 
caatcatgac 
tcccagcgag 
gtgggactcc 
tgtggccctg 
ttctaaattt 
tccttctcag 
tgaaatcaga 
ggaggaactt 
gaatatgatc 
cctggatttg 
aaagaactca 
gctggaccgg 
gaagctggac 
ctggatcaca 
ccctcatgcc 
gtatgtaaat 
tgcctcgcag 
ggcagccctg 
cgaaaagtat 
ctcagagctg 
cactcttgct 
gcccttgact 
gcaccatgtc 
ctggcaacac 
agttcagcag 
aggtgtgggg 
agaagtggca 
ggctcagact 
ccggattcaa 
gtcctttcac 
gctggacgac 
ccagcagcag 
catccagcag 
caaccacatt 
cttccaggag 
cgccatcgac 
tgacttcgac 
agccttgacc 
gtatgtcaat 
ggcaactcgg 
agccgcagag 
agaagtgaaa 
tatcacagcc 
gcatgccatt 
gctacaggcc 
tcactggcaa 
cgctttctac 
caagagagaa 
ccacgtgacg 
cacccttgct 
catgccagga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
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atggtgacgc acatcaaagc tcctgaacag caagtgaaaa atatcttgaa tgaactcttc 
caacgggaga acagggtatt gcattactgg accatgagga agagacggct ggaccagtgt 
cagcagtacg tggtctttga gaggagtgcc aagcaggctt tggaatggat ccatgacaat 
ggcgagttct acctttccac acacacctcc acgggctcca gtatacagca cacccaggag 
ctcctgaaag agcacgagga gttccagata actgcaaagc aaaccaaaga gagagtgaag 
ctattgatac agctggctga tggcttttgt gaaaaagggc atgcccatgc ggcagagata 
aaaaaatgtg ttactgctgt ggataagagg tacagagatt tctctctgcg gatggagaag 
tacaggacct ctttggaaaa agccctgggg atttcttcag attccaacaa atcgagtaaa 
agtctccagc tagatatcat tccagccagt atccctggct cagaggtgaa acttcgagat 
gctgctcatg aacttaatga agagaagcgg aaatctgccc gcaggaaaga gttcataatg 
gctgagctca ttcaaactga aaaggcttat gtaagagacc tccgggaatg tatggatacg 
tacctgtggg aaatgaccag tggcgtggaa gagattccac ctggcattgt aaacaaagaa 
ctcatcatct tcggaaacat gcaagaaatc tacgaatttc ataataacat attcctaaag 
gagctggaaa aatatgaaca gttgccagag gatgttggac attgttttgt tacttgggca 
gacaagtttc agatgtatgt cacatattgc aaaaataagc ctgattctac tcagctgata 
ttggaacatg cagggtccta ttttgacgag atacagcagc gacatggatt agccaattcc 
atttcttcct accttattaa accagttcag cgaataacga aatatcagct ccttttaaaa 
gagctgctga cgtgctgtga ggaaggaaag ggagagatta aagatggcct ggaggtgatg 
ctcagcgtgc cgaagcgagc caatgacgcc atgcacctca gcatgctgga agggtttgat 
gaaaacattg agtctcaggg agaactcatc ctacaggaat ccttccaagt gtgggaccca 
aaaaccttaa ttcgaaaggg tcgagaacgg catctcttcc tttttgaaat gtccttagta 
tttagtaaag aagtgaaaga ttccagtggg agaagcaagt acctttataa aagcaaattg 
tttacctcag agttgggtgt cacagaacat gttgaaggag acccttgcaa atttgcactg 
tgg9tgggga gaacaccaac ttcagataat aaaattgtcc ttaaggcttc cagcatagag 
aacaagcagg actggataaa gcatatccgc gaagtcatcc aggagcggac gatccacctg 
aagggagccc tgaaggagcc cattcacatc cctaagaccg ctcccgccac aagacagaag 
ggaaggaggg atggagagga tctggacagc caaggagacg gcagcagcca gcctgatacg 
atttccatcg cctcacggac gtctcagaac acgctggaca gcgataagct ctctggtggc 
tgtgagctga cagtggtgat ccatgacttc accgcttgca acagcaacga gctgaccatc 
cgacggggcc agaccgtgga agttctggag cggccgcatg acaagcctga ctggtgtctg 
gtgcggacca ctgaccgctc cccagcggca gaaggcctgg tcccctgtgg ttcactgtgc 
atcgcccact ccagaagtag catggaaatg gagggcatct tcaaccacaa agactcgctc 
tccgtctcca gcaatgacgc cagtccaccc gcatccgtgg cttccctcca gccccacatg 
atcggggccc agagctcgcc gggccccaag cggccgggca acaccctgcg caagtggctc 
accagccccg tgcggcggct cagcagcggc aaggccgacg ggcacgtgaa gaagctggcg 
cacaagcaca agaagagccg cgaggtccgc aagagcgccg acgccggctc gcagaaggac 
tccgacgaca gtgcggccac cccgcaggac gagacggtcg aggagagagg ccggaacgag 
ggcctgagca gcggtactct ctccaaatcc tcctcctcgg ggatgcagag ctgtggagaa 
gaggaaggcg aggagggggc cgacgccgtg cccctgccgc cacccatggc catccagcag 
cacagcctcc tccagccaga ctcacaggat gacaaggcct cttctcggtt attagtccgc 
cccaccagct ccgaaacacc gagtgcagcc gagctcgtca gtgcaattga ggaactcgtg 
aaaagcaaga tggcactgga ggatcgcccc agctcactcc ttgttgacca gggagatagt 
agcagccctt ccttcaaccc ttcggataat tcccttctct cttcctcctc gcccattgat 
gagatggaag aaaggaaatc cagctcttta aagagaagac actacgtttt gcaagaacta 
gtggagacag agcgtgacta tgtgcgggac cttggctatg tggttgaggg ctacatggca 
cttatgaaag aagatggtgt tcctgatgac atgaaaggaa aagacaaaat tgtgttcggc 
aacatccatc agatttacga ctggcacaga gacttttttt taggagagtt agagaagtgc 
cttgaagatc cagaaaaact aggatccctt tttgttaaac acgagagaag gttgcacatg 
tacatagctt attgtcaaaa taaaccaaag tctgagcaca ttgtctcaga atacattgat 
accttttttg aggacttaaa gcagcgtctt ggccacaggt tacagctcac agatctgttg 
atcaaaccag tgcagagaat catgaagtat cagctgttac tgaaggactt cctcaagtat 
tccaaaaagg ccagcctgga tacatcagaa ttagagagag ctgtggaagt catgtgcata 
gtacccaggc ggtgcaacga catgatgaac gtggggcggc tgcaaggatt cgacgggaaa 
atcgttgccc agggtaaact gctcttgcag gacacattct tggtcacaga ccaagatgca 



3240 
3300 
3360 
3420 
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4800 
4860 
4920 
4980 
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5100 
5160 
5220 
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5400 
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5640 
5700 
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6000 
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ggacttctgc 

agcgaaccac 

atcaaggtga 

ctgacatcga 

gtccggcaaa 

aatgccttga 

ggcagcgggg 

gcggccacag 

cctcccggat 

cgagccaggc 

accatgttgg 

ggagaggtcg 

gccactgacc 

agtgcagtca 

gcactccgtt 

ttagagaacg 

ctcaatccca 

acgtgtgaga 

tcaattacct 

tcctacagtg 

ggcatctaca 

agggtcctag 

ttctacagtg 

cagaaaggaa 

gaccaggtca 

ctcctcgaca 

ggtcgcctcc 

cacctggggg 

gacctaaagc 

gctgactttg 

aaccctgaat 

acgtggagtg 

gacagtgtgg 

tactttaaag 

cccgccaagc 

ggcagaagca 

caccagaatg 

cctagagttt 

taatctgaat 

cgtgtgaaat 

ctgcgctggg 

atcacagtat 



ctcgctgcag 

ttgataaaaa 

gttgcctttg 

ggacgggtga 

cttggatcca 

catcgccaat 

cagcggcggg 

tggcggcccc 

cccccagcct 

agaggcagac 

tgacacacga 

ttcaaattct 

agtgccccgc 

tcgtggagaa 

taaggaaaaa 

gttatcggaa 

actacattta 

caggggagac 

ggaagggccc 

acctgggaga 

cgtgcatcgc 

gtccagggat 

aagtggctga 

ccaagcgagc 

cccatgagct 

cctttgagac 

tggactgcgt 

aggttctgga 

ctgagaatat 

gagatgctgt 

tcgcagcccc 

ttggagtgct 

aagagacctg 

gagtgagcca 

gtccctcggc 

cgggcgtcct 

atgttcgacc 

gacctatcca 

tttcaagaga 

tgcattccaa 

gtggaggacc 

tttttacgga 



agagaggcgc 

gaagggcttc 

cctggaggaa 

cgtggtagag 

tgaaatcaac 

cgagtaccag 

ggtgggggca 

agcagctgcg 

gtccgacacc 

aagatgtcag 

ttacacggca 

ggccagcaac 

agctgagggc 

cccggacggg 

atctgagaaa 

gtcacgggaa 

tgacgttccc 

cgttgttctt 

tgaacacaac 

ggccacgctg 

tgtcaatgac 

ggatgggatc 

gcttggcagg 

agtggccact 

tggcatcctg 

ccccaccagc 

ggtgcgatgg 

agctgtccgg 

cctggtggat 

tcagctcaac 

tgaaatcatc 

cacatacgta 

cctgaacatt 

gaaggccaag 

tgcgctggcc 

cgacacgtcc 

tatccgtagc 

gaagttcttt 

aaacaagcaa 

gtgagctgtg 

gtcacttaca 

tttctg 



atcttcctct 

tccatgccgg 

aatgtggaaa 

accttcattt 

caaattttag 

aggaaccaca 

gcggcggcgg 

gcggcgcccc 

acccccccgt 

agtgaaagca 

gtgaaggagg 

cagcagaaca 

tggattccag 

actctcaaga 

aaagataaag 

ggactcagca 

ccagaattcg 

agatgtcgag 

accttgaaca 

aagattgtgg 

atgggttcag 

atggtgacct 

ggcagattct 

aagtttgtga 

cagagcctcc 

tacatcctgg 

ggaagcctca 

tacctgcaca 

gagagtttag 

acgacctact 

ctcgggaacc 

cttcttagtg 

tgccgcttag 

gagttcgtgt 

ctccaggagc 

agactgactt 

attaaaaact 

ctcattctct 

acataactga 

ctcagcagtg 

ctctgccaag 



ttgagcagat 

gattcctgtt 

atgatccctg 

tgcattcatc 

aaaaccagcg 

gcgggggcgg 

cggggccccc 

cagcacgagc 

gctggtctcc 

gcagcagtag 

atgagatcaa 

tgtttctggt 

gctttgtcct 

agtcaacatc 

acggcaaaag 

acaaggtatc 

tcattccatt 

tctgtggccg 

acgatggtca 

gcgtgaccac 

cctcatcatc 

ggaaagacaa 

ctgtcgttaa 

acaagaagtt 

agcaccccct 

tcttagaaat 

ctgaagggaa 

actgcaggat 

ccaagccaac 

acatccacca 

ctgtctccct 

gcgtgtcccc 

actttagctt 

gcttcctcct 

agtggctgca 

ccttcattga 

ttctgcagag 

ttcacctgcc 

tcagctgccg 

cttggacaca 

gacggaggtc 



cgtcatattc 

taagaacagt 

taaatttgct 

tagtccaagt 

caatttttta 

cggcggcggc 

agtggcggca 

aggagccggc 

tctgcagcct 

caacatctcc 

cgtctaccaa 

gttccgagcc 

gggccacacc 

ttggcacaca 

ggaaggcaag 

tgtgaagctt 

gagtgaggtc 

ccccaaagcc 

ctacagcatc 

ggaagatgac 

ggccagcctg 

ctttgactcc 

gaaatgtgat 

gatgaagcgc 

gcttgtcggc 

ggctgaccag 

gatcagggcg 

agcacacctg 

catcaaactg 

gttactgggg 

gacctcggat 

cttcctggat 

cccagatgac 

gcaggaggac 

ggccggcaac 

gcggcgcaaa 

caggcttctg 

aatcagctgt 

gtatgttcat 

gagctgcaag 

gcattgctgt 
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7500 
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7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 
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8340 

8400 
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8640 

8700 

8760 
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8906 



<210> 665 

<211> 1204 

<212> DNA 

<213> Homo sapiens 

<400> 665 

tcggcggcgg tggtatcggc ggcagctgtg agggggttcc gggaagatgg tgctgatcaa 
ggaattccgt gtggttttgc catgttctgt tcaggagtat caggttgggc agctttactc 
tgttgcagaa gctagtaaga atgagactgg tggtggagaa ggaattgaag tcttaaagaa 
tgaaccttat gagaaggatg gagaaaaggg acagtatacg cacaaaattt atcacctaaa 
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gagcaaagtg cctgcattcg 
gaaagcctgg aatgcgtacc 
tgatttcttc attaaaatcg 
acatggttta gatccaaaca 
tagaagtcaa gttgaaccag 
agtcaagacc aagagaggcc 
tgactgtccc cagatgtgtg 
gcaaagcaaa gtagaaaact 
tcgccagctt ttttgttgga 
aatggaagac gagactcaga 
cacgtcggct gctgatgtct 
ttacgtaatc aaggtcaagt 
tgaccagtat cttgcagtgt 
tgcacaggtt ctcaaccacg 
taaagcgcat gtagagctac 
acatactgaa gggattagta 
caaa 

<210> 666 
<211> 892 
<212> PRT 
<213> Homo sapiens 



<400> 666 
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Trp 


Asp 


Arg 


Asp 


Leu 


Leu 


Leu 


Asp 


Pro 


Ala 


Trp 


Glu 


Lys 


Gin 








20 










25 










30 






Gin 


Arg 


Lys 


Thr 


Phe 


Thr 


Ala 


Trp 


Cys 


Asn 


Ser 


His 


Leu 


Arg 


Lys 


Ala 






35 










40 










45 








Gly 


Thr 


Gin 


He 


Glu 


Asn 


He 


Glu 


Glu 


Asp 


Phe 


Arg 


Asp 


Gly 


Leu 


Lys 




50 










55 










60 










Leu 


Met 


Leu 


Leu 


Leu 


Glu 


Val 


He 


Ser 


Gly 


Glu 


Arg 


Leu 


Ala 


Lys 


Pro 


65 










70 










75 










80 


Glu 


Arg 


Gly Lys 


Met 


Arg 


Val 


His 


Lys 


He 


Ser 


Asn 


Val 


Asn 


Lys 


Ala 










85 










90 










95 




Leu 


Asp 


Phe 


He 


Ala 


Ser 


Lys 


Gly 


Val 


Lys 


Leu 


Val 


Ser 


He 


Gly 


Ala 








100 










105 










110 






Glu 


Glu 


He 


Val 


Asp 


Gly 


Asn 


Val 


Lys 


Met 


Thr 


Leu 


Gly 


Met 


He 


Trp 






115 










120 










125 








Thr 


He 


He 


Leu 


Arg 


Phe 


Ala 


He 


Gin 


Asp 


He 


Ser 


Val 


Glu 


Glu 


Thr 




130 










135 










140 










Ser 


Ala 


Lys 


Glu 


Gly 


Leu 


Leu 


Leu 


Trp 


Cys 


Gin 


Arg 


Lys 


Thr 


Ala 


Pro 


145 










150 










155 










160 


Tyr 


Lys 


Asn 


Val 


Asn 


He 


Gin 


Asn 


Phe 


His 


He 


Ser 


Trp 


Lys 


Asp 


Gly 










165 










170 










175 




Leu 


Gly 


Phe 


Cys 


Ala 


Leu 


He 


His 


Arg 


His 


Arg 


Pro 


Glu 


Leu 


He 


Asp 








180 










185 










190 






Tyr 


Gly 


Lys 


Leu Arg 


Lys 


Asp 


Asp 


Pro 


Leu 


Thr 


Asn 


Leu 


Asn 


Thr 


Ala 






195 










200 










205 








Phe 


Asp 


Val 


Ala 


Glu 


Lys 


Tyr 


Leu 


Asp 


He 


Pro 


Lys 


Met 


Leu 


Asp 


Ala 




210 










215 










220 










Glu 


Asp 


He 


Val 


Gly 


Thr 


Ala 


Arg 


Pro 


Asp 


Glu 


Lys 


Ala 


He 


Met 


Thr 


225 










230 










235 










240 
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tgaggatgat tgctcccgag ggctccttgg tgtttcatga 3 00 

cctactgtag aacaattgta acgaatgaat atatgaaaga 360 

aaacatggca caaaccagac ttgggaacat tagaaaatgt 420 

catggaaaac tgttgaaatt gtccatatag atattgcaga 480 

cagactacaa agctgatgaa gacccagcat tattccagtc 540 

ctttgggacc caactggaag aaggagctgg caaacagccc 600 

cctataagct ggtgaccatc aaattcaagt ggtggggact 660 

tcattcaaaa gcaagaaaaa cggatattta caaacttcca 720 

ttgacaagtg gatcgatctc acgatggaag acattaggag 780 

aagaactaga aacaatgcgt aagaggggtt ccgttcgagg 840 

agatgagtcc cctgtagggt cagagacaat gtcaaactgt 900 

gaggggaaca agcgcagcca gtgatgagtg aacaacaatc 960 

tgacgtttcc cagatgtgtg cttgtgatga tacacacaca 1020 

tgtgtatata tgtatgtgtg catatgtctg tagctgtata 1080 

agatccagat acacacactt gtgtatatat gtacatacag 1140 

caatttctcc aaagtactgt acctatcttc agcaagaatg 1200 

1204 
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Tyr Val Ser Ser 

Thr Ala Ala Asn 
260 

Glu Gin Leu Met 
275 

Trp lie Arg Arg 
290 

Thr Met His Ala 
305 

Arg Leu His Lys 

Asn Phe Asn Thr 
340 

Phe Met Pro Ser 
355 

Gly Cys Leu Glu 
370 

Glu lie Arg Arg 
385 

Gin Lys Ala Ser 

Leu Arg Gin Lys 
420 

Leu Leu Lys Lys 
435 

Asp Arg Val Glu 
450 

Asp Tyr Tyr Asp 
465 

Asp Gin Trp Asp 

Leu Glu Arg Thr 
500 

Glu Tyr Ala Lys 
515 

Met Glu Asp Leu 
530 

Gin Gly Leu Thr 
545 

Ala Asp Lys Glu 

Lys lie Val Gin 
580 

Thr Thr He Thr 
595 

Gin Leu Val Pro 
610 

Gin Gin His Asn 
625 

Val He Gly Pro 

Ser He Glu Met 
660 



Phe Tyr His Ala 
245 

Arg lie Cys Lys 

Glu Asp Tyr Glu 
280 

Thr He Pro Trp 
295 

Met Gin Gin Lys 
310 

Pro Pro Lys Val 
325 

Leu Gin Thr Lys 

Glu Gly Arg Met 
360 

Gin Val Glu Lys 
375 

Leu Glu Arg Leu 
390 

He His Glu Ala 
405 

Asp Tyr Glu Thr 

His Glu Ala Phe 
440 

Gin He Ala Ala 
455 

Ser Pro Ser Val 
470 

Asn Leu Gly Ala 
485 

Glu Lys Leu Leu 

Arg Ala Ala Pro 
520 

Gin Asp Thr Phe 
535 

Thr Ala His Glu 
550 

Arg Leu Ala He 
565 

Thr Tyr His Val 

Pro Gin Glu He 
600 

Arg Arg Asp Gin 
615 

Glu Arg Leu Arg 
630 

Trp He Gin Thr 
645 

His Gly Thr Leu 



Phe Ser Gly Ala 
250 

Val Leu Ala Val 
265 

Lys Leu Ala Ser 

Leu Glu Asn Arg 
300 

Leu Glu Asp Phe 
315 

Gin Glu Lys Cys 
330 

Leu Arg Leu Ser 
345 

Val Ser Asp He 

Gly Tyr Glu Glu 
380 

Asp His Leu Ala 
395 

Trp Thr Asp Gly 
410 

Ala Thr Leu Ser 
425 

Glu Ser Asp Leu 

He Ala Gin Glu 
460 

Asn Ala Arg Cys 
475 

Leu Thr Gin Lys 
490 

Glu Thr He Asp 
505 

Phe Asn Asn Trp 

He Val His Thr 
540 

Gin Phe Lys Ala 
555 

Leu Gly He His 
570 

Asn Met Ala Gly 
585 

Asn Gly Lys Trp 

Ala Leu Thr Glu 
620 

Lys Gin Phe Gly 
635 

Lys Met Glu Glu 
650 

Glu Asp Gin Leu 
665 



Gin Lys Ala Glu 
255 

Asn Gin Glu Asn 
270 

Asp Leu Leu Glu 
285 

Val Pro Glu Asn 

Arg Asp Tyr Arg 
320 

Gin Leu Glu He 
335 

Asn Arg Pro Ala 
350 

Asn Asn Ala Trp 
365 

Trp Leu Leu Asn 

Glu Lys Phe Arg 
400 

Lys Glu Ala Met 
415 

Glu He Lys Ala 
430 

Ala Ala His Gin 
445 

Leu Asn Glu Leu 

Gin Lys He Cys 
480 

Arg Arg Glu Ala 
495 

Gin Leu Tyr Leu 
510 

Met Glu Gly Ala 
525 

He Glu Glu He 

Thr Leu Pro Asp 
560 

Asn Glu Val Ser 
575 

Thr Asn Pro Tyr 
590 

Asp His Val Arg 
605 

Glu His Ala Arg 

Ala Gin Ala Asn 
640 

He Gly Arg He 
655 

Ser His Leu Arg 
670 
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Gin 


Tyr 


Glu 


Lys Ser 


He 


Val 


Asn 


Tyr 


Lys 


Pro 


Lys 


He 


Asp 


Gin 


Leu 






675 








680 










685 








Glu 


Gly 


Asp 


His Gin 


Leu 


He 


Gin 


Glu 


Ala 


Leu 


He 


Phe 


Asp 


Asn 


Lys 




690 








695 










700 










His 


Thr 


Asn 


Tyr Thr 


Met 


Glu 


His 


He 


Arg 


Val 


Gly 


Trp 


Glu 


Gin 


Leu 


705 








710 










715 










720 


Leu 


Thr 


Thr 


He Ala 


Arg 


Thr 


He 


Asn 


Glu 


Val 


Glu 


Asn 


Gin 


He 


Leu 








725 










730 










735 




Thr 


Arg 


Asp 


Ala Lys 


Gly 


He 


Ser 


Gin 


Glu 


Gin 


Met 


Asn 


Glu 


Phe 


Arg 








740 








745 










750 






Ala 


Ser 


Phe 


Asn His 


Phe 


Asp 


Arg Asp His 


Ser 


Gly 


Thr 


Leu 


Gly 


Pro 






755 








760 










765 








Glu 


Glu 


Phe 


Lys Ala 


Cys 


Leu 


He 


Ser 


Leu 


Gly 


Tyr 


Asp 


He 


Gly 


Asn 




770 








775 










780 










Asp 


Pro 


Gin 


Gly Glu 


Ala 


Glu 


Phe 


Ala 


Arg 


He 


Met 


Ser 


He 


Val 


Asp 


785 








790 










795 










800 


Pro 


Asn 


Arg 


Leu Gly 


Val 


Val 


Thr 


Phe 


Gin 


Ala 


Phe 


He 


Asp 


Phe 


Met 








805 










810 










815 




Ser 


Arg 


Glu 


Thr Ala 


Asp 


Thr 


Asp 


Thr 


Ala 


Asp 


Gin 


Val 


Met 


Ala 


Ser 








820 








825 










830 






Phe 


Lys 


He 


Leu Ala Gly Asp 


Lys 


Asn Tyr 


He 


Thr 


Met 


Asp 


Glu 


Leu 






p 








840 










QA C 
O *± _> 








Arg 


Arg 


Glu 


Leu Pro 


Pro 


Asp 


Gin 


Ala 


Glu 


Tyr 


Cys 


He 


Ala 


Arg 


Met 




850 








855 










860 










Ala 


Pro 


Tyr 


Thr Gly 


Pro 


Asp 


Ser 


Val 


Pro 


Gly 


Ala 


Leu 


Asp 


Tyr 


Met 


865 








870 










875 










880 


Ser 


Phe 


Ser 


Thr Ala 


Leu 


Tyr 


Gly Glu 


Ser 


Asp 


Leu 
















885 










890 














<210> 667 


























<211> 197 


























<212> PRT 


























<213> Homo sapiens 
























<400> 667 


























Met 


Met 


Phe 


Pro Gin 


Ser 


Arg 


His 


Ser Gly 


Ser 


Ser 


His 


Leu 


Pro 


Gin 


1 






5 










10 










15 




Gin 


Leu 


Lys 


Phe Thr 


Thr 


Ser 


Asp 


Ser 


Cys 


Asp 


Arg 


He 


Lys 


Asp 


Glu 








20 








25 










30 






Phe 


Gin 


Leu 


Leu Gin 


Ala 


Gin 


Tyr 


His 


Ser 


Leu 


Lys 


Leu 


Glu 


Cys 


Asp 






35 








40 










45 








Lys 


Leu 


Ala 


Ser Glu 


Lys 


Ser 


Glu 


Met 


Gin 


Arg 


His 


Tyr 


Val 


Met 


Tyr 




50 








55 










60 










Tyr 


Glu 


Met 


Ser Tyr 


Gly 


Leu 


Asn 


He 


Glu 


Met 


His 


Lys 


Gin 


Ala 


Glu 


65 








70 










75 










80 


lie 


Val 


Lys Arg Leu Asn Gly 


He 


Cys 


Ala 


Gin 


Val 


Leu 


Pro 


Tyr 


Leu 








85 










90 










95 




Ser 


Gin 


Glu 


His Gin 


Gin 


Gin 


Val 


Leu Gly 


Ala 


He 


Glu 


Arg 


Ala 


Lys 








100 








105 










110 






Gin 


Val 


Thr 


Ala Pro 


Glu 


Leu 


Asn 


Ser 


He 


He 


Arg 


Gin 


Gin 


Leu 


Gin 






115 








120 










125 








Ala 


His 


Gin 


Leu Ser 


Gin 


Leu 


Gin 


Ala 


Leu 


Ala 


Leu 


Pro 


Leu 


Thr 


Pro 




130 








135 










140 










Leu 


Pro 


Val 


Gly Leu 


Gin 


Pro 


Pro 


Ser 


Leu 


Pro 


Ala 


Val 


Ser 


Ala 


Gly 
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145 150 155 160 

Thr Gly Leu Leu Ser Leu Ser Ala Leu Gly Ser Gin Ala His Leu Ser 

165 170 175 

Lys Glu Asp Lys Asn Gly His Asp Gly Asp Thr His Gin Glu Asp Asp 

180 185 190 

Gly Glu Lys Ser Asp 
195 

<210> 668 
<211> 739 
<212> PRT 

<213> Homo sapiens 
<400> 668 

Gly Asp Lys Glu Pro Thr Glu Thr lie Gly Asp Leu Ser lie Cys Leu 

15 10 15 

Asp Gly Leu Gin Leu Glu Ser Glu Val Val Thr Asn Gly Glu Thr Thr 

20 25 30 

Cys Ser Glu Ser Ala Ser Gin Asn Asp Asp Gly Ser Arg Ser Lys Asp 

35 40 45 

Glu Thr Arg Val Ser Thr Asn Gly Ser Asp Asp Pro Glu Asp Ala Gly 

50 55 60 

Ala Gly Glu Asn Arg Arg Val Ser Gly Asn Asn Ser Pro Ser Leu Ser 
65 A 70 75 80 

Asn Gly Gly Phe Lys Pro Ser Arg Pro Pro Arg Pro Ser Arg Pro Pro 

85 90 95 

Pro Pro Thr Pro Arg Arg Pro Ala Ser Val Asn Gly Ser Pro Ser Ala 

100 105 110 

Thr Ser Glu Ser Asp Gly Ser Ser Thr Gly Ser Leu Pro Pro Thr Asn 

115 120 125 

Thr Asn Thr Asn Thr Ser Glu Gly Ala Thr Ser Gly Leu lie He Pro 

130 135 140 

Leu Thr He Ser Gly Gly Ser Gly Pro Arg Pro Leu Asn Pro Val Thr 
145 150 155 160 

Gin Ala Pro Leu Pro Pro Gly Trp Glu Gin Arg Val Asp Gin His Gly 

165 170 175 

Arg Val Tyr Tyr Val Asp His Val Glu Lys Arg Thr Thr Trp Asp Arg 

180 185 190 

Pro Glu Pro Leu Pro Pro Gly Trp Glu Arg Arg Val Asp Asn Met Gly 

195 200 205 

Arg He Tyr Tyr Val Asp His Phe Thr Arg Thr Thr Thr Trp Gin Arg 

210 215 220 

Pro Thr Leu Glu Ser Val Arg Asn Tyr Glu Gin Trp Gin Leu Gin Arg 
225 230 235 240 

Ser Gin Leu Gin Gly Ala Met Gin Gin Phe Asn Gin Arg Phe He Tyr 

245 250 255 

Gly Asn Gin Asp Leu Phe Ala Thr Ser Gin Ser Lys Glu Phe Asp Pro 

260 265 270 

Leu Gly Pro Leu Pro Pro Gly Trp Glu Lys Arg Thr Asp Ser Asn Gly 

275 280 285 

Arg Val Tyr Phe Val Asn His Asn Thr Arg He Thr Gin Trp Glu Asp 

290 295 300 

Pro Arg Ser Gin Gly Gin Leu Asn Glu Lys Pro Leu Pro Glu Gly Trp 
305 ~ 310 315 320 
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Glu 


Met 


Arg 


Phe 


Arg 


Arg 


Thr 


Thr 








340 


Asp 


Asn 


vj JLy 


Pro 






355 




Gin 


Tyr 


Phe 


Arg 




370 






Lys 


lie 


Thr 


Val 


385 








lie 


Met 


Ser 


Phe 


Phe 


Pro 


Gly Glu 








420 


JrXie 


pne 


Leu 


Leu 






435 




Glu 


Tyr 


Ala 


Gly 




450 






Tyr 


He 


Asn 


Pro 


465 








lie 


Ala 


Met 


Ala 


Leu 


Pro 


Phe 


Tyr 








500 


Leu 


blU 


Ser 


He 






515 




Glu 


Asn 


Asn 


He 




530 






Lys 


Glu 


He 


Leu 


545 








Gly 


Asn 


He 


Leu 


Val 


Ala 


Glu 


Trp 








580 


Phe 


Phe 


Glu Gly 






595 




Phe 


Asp 


Ala 


Lys 




610 






Asp 


Leu 


Asn Asp 



625 

Thr Ser Lys Gin 

Asn Glu Lys Arg 
660 

Leu Pro Val Gly 
675 

Lys Phe Cys He 
690 

His Thr Cys Phe 
705 

Gin Leu Lys Glu 
Gly Gin Glu 



Thr Val Asp Gly 
325 

Thr Tyr He Asp 

Gin He Ala Tyr 
360 

Phe Trp Cys Gin 
375 

Thr Arg Lys Thr 
390 

Ser Pro Gin Asp 
405 

Glu Gly Leu Asp 

Ser His Glu Val 
440 

Lys Asp Asn Tyr 
455 

Asp His Leu Lys 
470 

Leu Phe His Gly 
485 

Lys Arg He Leu 

Asp Pro Glu Phe 
520 

Glu Glu Cys Asp 
535 

Gly Glu lie Lys 
550 

Val Thr Glu Glu 
565 

Arg Leu Ser Arg 

Phe Asn Glu He 
600 

Glu Leu Glu Val 
615 

Trp Gin Arg His 
630 

He Met Trp Phe 
645 

Met Arg Leu Leu 

Gly Phe Ala Asp 
680 

Glu Lys Val Gly 
695 

Asn Arg Leu Asp 
710 

Lys Leu Leu Phe 
725 



He Pro Tyr Phe 
330 

Pro Arg Thr Gly 
345 

Val Arg Asp Phe 

Gin Leu Ala Met 
380 

Leu Phe Glu Asp 
395 

Leu Arg Arg Arg 
410 

Tyr Gly Gly Val 
42 5 

Leu Asn Pro Met 

Cys Leu Gin He 
460 

Tyr Phe Arg Phe 
475 

Lys Phe He Asp 
490 

Asn Lys Pro Val 
505 

Tyr Asn Ser Leu 

Leu Glu Met Tyr 
540 

Ser His Asp Leu 
555 

Asn Lys Glu Glu 
570 

Gly Val Glu Glu 
585 

Leu Pro Gin Gin 

Leu Leu Cys Gly 
620 

Ala He Tyr Arg 
635 

Trp Gin Phe Val 
650 

Gin Phe Val Thr 
665 

Leu Met Gly Ser 

Lys Glu Asn Trp 
700 

Leu Pro Pro Tyr 
715 

Ala He Glu Glu 
730 



vai 


Asp 


nlS 


Asn 






335 




Lys 


Ser 


Ala 


Leu 




350 






Lys 


Ala 


Lys 


Val 


365 








Pro 


Gin 


His 


He 


Ser 


Phe 


Gin 


Gin 








400 


Leu 


Trp 


vai 


lie 






415 




Ala 


Arg 


Glu 


Trp 




430 






Tyr 


Cys 


Leu 


Phe 


445 








Asn 


Pro 


Ala 


Ser 


He 


Gly Arg 


Phe 








480 


X 11 XT 


Gly Phe 


Ser 






495 




Gly 


Leu 


Lys 


Asp 




510 






He 


Trp 


Val 


Lys 


525 








Phe 


Ser 


Val 


Asp 


Lys 


Pro 


Asn 


Gly 








560 


Tyr 


He 


Arg 


Met 






575 




Gin 


Thr 


Gin 


Ala 




590 






Tyr 


Leu 


Gin 


Tyr 


605 








Met 


Gin 


Glu 


He 


His 


Tyr Ala 


Arg 








640 


Lys 


Glu 


He 


Asp 






655 




Gly 


Thr 


Cys 


Arg 




670 






Asn 


Gly 


Pro 


Gin 


685 








Leu 


Pro 


Arg 


Ser 


Lys 


Ser 


Tyr 


Glu 








720 


Thr 


Glu Gly 


Phe 






735 





-222- 



BNSDOCID: <WO 02092000 A2_l_> 





WO 02/092000 



PCT/US02/14877 



<210> 669 
<211> 431 
<212> PRT 
<213> Homo sapiens 

<400> 669 

Gly Pro Pro Pro Thr Arg Ala Leu Pro Leu Pro Gin Ser Leu Pro Pro 

15 10 15 

Asp Phe Arg Leu Glu Pro Thr Ala Pro Ala Leu Ser Pro Arg Ser Ser 

20 25 3 0 

Phe Ala Ser Ser Ser Ala Ser Asp Ala Ser Lys Pro Ser Ser Pro Arg 

35 40 45 

Gly Ser Leu Leu Leu Asp Gly Ala Gly Ala Gly Gly Ala Gly Gly Ser 

50 55 60 

Arg Pro Cys Ser Asn Arg Thr Ser Gly lie Ser Met Gly Tyr Asp Gin 
65 70 75 80 

Arg His Gly Ser Pro Leu Pro Ala Gly Pro Cys Leu Phe Gly Pro Pro 

85 90 95 

Leu Ala Gly Ala Pro Ala Gly Tyr Ser Pro Gly Gly Val Pro Ser Ala 

100 105 110 

Tyr Pro Glu Leu His Ala Ala Leu Asp Arg Leu Tyr Ala Gin Arg Pro 

115 120 125 

Ala Gly Phe Gly Cys Gin Glu Ser Arg His Ser Tyr Pro Pro Ala Leu 

130 135 140 

Gly Ser Pro Gly Ala Leu Ala Gly Ala Arg Val Gly Ala Ala Gly Pro 
145 150 155 160 

Leu Glu Arg Arg Gly Ala Gin Pro Gly Arg His Ser Val Thr Gly Tyr 

165 170 175 

Gly Asp Cys Ala Val Gly Ala Arg Tyr Gin Asp Glu Leu Thr Ala Leu 

180 185 190 

Leu Arg Leu Thr Val Gly Thr Gly Gly Arg Glu Ala Gly Ala Arg Gly 

195 200 205 

Glu Pro Ser Gly lie Glu Pro Ser Gly Leu Glu Glu Pro Pro Gly Pro 

210 215 220 

Phe Val Pro Glu Ala Ala Arg Ala Arg Met Arg Glu Pro Glu Ala Arg 
225 230 235 240 

Glu Asp Tyr Phe Gly Thr Cys lie Lys Cys Asn Lys Gly He Tyr Gly 

245 250 255 

Gin Ser Asn Ala Cys Gin Ala Leu Asp Ser Leu Tyr His Thr Gin Cys 

260 265 270 

Phe Val Cys Cys Ser Cys Gly Arg Thr Leu Arg Cys Lys Ala Phe Tyr 

275 280 285 

Ser Val Asn Gly Ser Val Tyr Cys Glu Glu Asp Tyr Leu Phe Ser Gly 

290 295 300 

Phe Gin Glu Ala Ala Glu Lys Cys Cys Val Cys Gly His Leu He Leu 
305 310 315 320 

Glu Lys He Leu Gin Ala Met Gly Lys Ser Tyr His Pro Gly Cys Phe 

325 330 335 

Arg Cys He Val Cys Asn Lys Cys Leu Asp Gly He Pro Phe Thr Val 

340 345 350 

Asp Phe Ser Asn Gin Val Tyr Cys Val Thr Asp Tyr His Lys Asn Tyr 

355 360 365 

Ala Pro Lys Cys Ala Ala Cys Gly Gin Pro He Leu Pro Ser Glu Gly 
370 375 380 
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Cys 


Glu 


Asp 


He 


Val 


Arg Val He Ser Met 


Asp 


Arg 


Asp 


Tyr 


His 


Phe 


385 










390 


395 










400 


Glu 


Cys 


Tyr 


His 


Cys 


Glu Asp Cys Arg Met 


Gin 


Leu 


Ser 


Asp 


Glu 


Glu 










405 


• 410 










415 




Gly 


Cys 


Cys 


Cys 


Phe 


Pro Leu Asp Gly His 


Leu 


Leu 


Cys 


His 


Gly 










420 




425 








430 







<210> 670 
<211> 900 
<212> PRT 

<213> Homo sapiens 
<400> 670 

Gly Pro Gly Ser Arg His His Arg Ala Arg Asp Arg Leu lie His Phe 

15 10 15 

Gly Ala Val Ser Thr Asp Val Leu Gly Cys Ser Ala His Cys Ser Leu 

20 25 30 

Thr Gin Ser Pro Lys Met Asn He Gin Glu Gin Gly Phe Pro Leu Asp 

35 40 45 

Leu Gly Ala Ser Phe Thr Glu Asp Ala Pro Arg Pro Pro Val Pro Gly 

50 55 60 

Glu Glu Gly Glu Leu Val Ser Thr Asp Pro Arg Pro Ala Ser Tyr Ser 
65 70 75 80 

Phe Cys Ser Gly Lys Gly Val Gly He Lys Gly Glu Thr Ser Thr Ala 

85 90 95 

Thr Pro Arg Arg Ser Asp Leu Asp Leu Gly Tyr Glu Pro Glu Gly Ser 

100 105 110 

Ala Ser Pro Thr Pro Pro Tyr Leu Lys Trp Ala Glu Ser Leu His Ser 

115 120 125 

Leu Leu Asp Asp Gin Asp Gly He Ser Leu Phe Arg Thr Phe Leu Lys 

130 135 140 

Gin Glu Gly Cys Ala Asp Leu Leu Asp Phe Trp Phe Ala Cys Thr Gly 
145 150 155 160 

Phe Arg Lys Leu Glu Pro Cys Asp Ser Asn Glu Glu Lys Arg Leu Lys 

165 170 175 

Leu Ala Arg Ala He Tyr Arg Lys Tyr He Leu Asp Asn Asn Gly He 

180 185 190 

Val Ser Arg Gin Thr Lys Pro Ala Thr Lys Ser Phe He Lys Gly Cys 

195 200 205 

He Met Lys Gin Leu lie Asp Pro Ala Met Phe Asp Gin Ala Gin Thr 

210 215 220 

Glu He Gin Ala Thr Met Glu Glu Asn Thr Tyr Pro Ser Phe Leu Lys 
225 230 235 240 

Ser Asp He Tyr Leu Glu Tyr Thr Arg Thr Gly Ser Glu Ser Pro Lys 

245 250 255 

Val Cys Ser Asp Gin Ser Ser Gly Ser Gly Thr Gly Lys Gly He Ser 

260 265 270 

Gly Tyr Leu Pro Thr Leu Asn Glu Asp Glu Glu Trp Lys Cys Asp Gin 

275 280 285 

Asp Met Asp Glu Asp Asp Gly Arg Asp Ala Ala Pro Pro Gly Arg Leu 

290 295 300 

Pro Gin Lys Leu Leu Leu Glu Thr Ala Ala Pro Arg Val Ser Ser Ser 
305 ' 310 315 320 

Arg Arg Tyr Ser Glu Gly Arg Glu Phe Arg Tyr Gly Ser Trp Arg Glu 
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325 



330 



335 



Pro Val Asn Pro Tyr Tyr Val Asn Ala Gly Tyr Ala Leu Ala Pro Ala 

340 345 350 

Thr Ser Ala Asn Asp Ser Glu Gin Gin Ser Leu Ser Ser Asp Ala Asp 

355 360 365 

Thr Leu Ser Leu Thr Asp Ser Ser Val Asp Gly lie Pro Pro Tyr Arg 

370 375 380 

He Arg Lys Gin His Arg Arg Glu Met Gin Glu Ser Ala Gin Val Asn 
385 ~ 390 395 400 

Gly Arg Val Pro Leu Pro His He Pro Arg Thr Tyr Arg Val Pro Lys 

405 410 415 

Glu Val Arg Val Glu Pro Gin Lys Phe Ala Glu Glu Leu He His Arg 

420 425 430 

Leu Glu Ala Val Gin Arg Thr Arg Glu Ala Glu Glu Lys Leu Glu Glu 

435 440 445 

Arg Leu Lys Arg Val Arg Met Glu Glu Glu Gly Glu Asp Gly Asp Pro 

450 455 460 

Ser Ser Gly Pro Pro Gly Pro Cys His Lys Leu Pro Pro Ala Pro Ala 
465 ~ 470 * 475 480 

Trp His His Phe Pro Pro Arg Leu Cys Trp Thr Trp Ala Cys Ala Gly 

485 490 495 

Leu Arg Asp Ala His Glu Glu Asn Pro Glu Ser He Leu Asp Glu His 

500 505 510 

Val Gin Arg Val Leu Arg Thr Thr Gly Arg Gin Ser Pro Gly Pro Gly 

515 520 525 

His Arg Ser Pro Asp Ser Gly His Val Ala Lys Met Pro Val Ala Leu 

530 535 540 

Gly Gly Ala Ala Ser Gly His Gly Lys His Val Pro Lys Ser Gly Ala 
545 550 555 560 

Lys Leu Asp Ala Ala Gly Leu His His His Arg His Val His His His 

565 570 575 

Val His His Ser Thr Ala Arg Pro Lys Glu Gin Val Glu Ala Glu Ala 

580 585 590 

Thr Arg Arg Ala Gin Ser Ser Phe Ala Trp Gly Leu Glu Pro His Ser 

595 600 605 

His Gly Ala Arg Ser Arg Gly Tyr Ser Glu Ser Val Gly Ala Ala Pro 

610 615 620 

Asn Ala Ser Asp Gly Leu Ala His Ser Gly Lys Val Gly Val Ala Cys 
625 630 635 640 

Lys Arg Asn Ala Lys Lys Ala Glu Ser Gly Lys Ser Ala Ser Thr Glu 

645 650 655 

Val Pro Gly Ala Ser Glu Asp Ala Glu Lys Asn Gin Lys He Met Gin 

660 665 670 

Trp He He Glu Gly Glu Lys Glu He Ser Arg His Arg Arg Thr Gly 

675 680 685 

His Gly Ser Ser Gly Thr Arg Lys Pro Gin Pro His Glu Asn Ser Arg 

690 695 700 

Pro Leu Ser Leu Glu His Pro Trp Ala Gly Pro Gin Leu Arg Thr Ser 
705 710 715 720 

Val Gin Pro Ser His Leu Phe He Gin Asp Pro Thr Met Pro Pro His 

725 730 735 

Pro Ala Pro Asn Pro Leu Thr Gin Leu Glu Glu Ala Arg Arg Arg Leu 

740 745 750 

Glu Glu Glu Glu Lys Arg Ala Ser Arg Ala Pro Ser Lys Gin Arg Tyr 
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755 








760 




Val 


Gin 


Glu 


Val 


Met 


Arg Arg Gly 


Arg 




770 










775 




Ala 


Pro 


Val 


Leu 


His 


Val 


Val Pro 


Ala 


785 










790 






Glu 


Thr 


Glu 


Thr 


Arg 


Ser 


Gin Arg 


Lys 










805 








Pro 


Cys 


Asp 


Ser 


He 


Val 


Val Ala 


Tyr 








820 








825 


Pro 


Tyr 


Arg 


Thr 


Leu 


Val 


Arg Gly 


Arg 






835 








840 




Lys 


Glu 


Leu 


Leu 


Thr 


Lys 


Lys Gly 


Ser 




850 










855 




Val 


Ser 


Asp 


Glu 


Phe 


Asp 


Cys Gly 


Val 


865 










870 






Asp 


Glu 


Ala 


Val 


Leu 


Pro 


Val Phe 


Glu 










885 








Glu 


Lys 


Val 


Asp 











900 



<210> 671 
<211> 591 
<212> PRT 

<213> Homo sapiens 



<400> 671 



Met 
1 


Val 


Pro 


Val 


Ala 


Val 


Thr 


Ala 


Ala 


Asn 


Ser 


Asp 


Phe 


5 

Ser 


Asp 


Leu Arg 


Glu 








20 










25 


He 


Ala 


Gly Leu 


Thr 


Arg 


Glu 


Arg 


Gly 






35 










40 




Ser 


Ala 


Glu 


Leu 


Ala 


Phe 


Ser 


Leu 


Pro 




50 










55 






Gin 


Pro 


Pro 


Pro 


Pro 


He 


Thr 


Glu 


Glu 


65 










70 








Tyr 


Thr 


Leu 


Ala 


Lys 


Ala 


Tyr 


Phe 


Asp 










85 










Ala 


His 


Phe 


Leu 


His 


Gly Cys 


Asn 


Ser 








100 










105 


Met 


Tyr 


Ser 


Arg 


Tyr 


Leu 


Ser Gly 


Glu 






115 










120 




Val 


Asp 


Ser 


Leu Gly 


Pro 


Leu 


Glu 


Lys 




130 










135 






Leu 


Arg 


Glu 


Leu Arg 


Val 


Glu 


Leu 


Ser 


145 










150 








Leu 


Asp 


Gly Phe Gly 


Leu 


Tyr 


Leu 


Tyr 










165 










Asp 


Leu 


Val 


Lys 


Glu 


Ala 


He 


Asp 


Val 








180 










185 


Leu 


Pro 


Leu 


His 


Trp 


Gly Ala Trp 


Leu 






195 










200 




Asp 


Lys 


Glu 


Met 


Leu 


Lys 


Phe 


Leu 


Ser 




210 










215 
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765 



Ala 


Cys 


Val 


Arg 


Pro 


Ala 


Cys 






780 










Val 


Ser 


Asp 


Met 


Glu 


Leu 


Ser 




795 










800 


Val 


Gly Gly Gly Ser 


Ala 


Gin 


810 










815 




Tyr 


Phe 


Cys 


Gly Glu 


Pro 


He 










830 






Ala 


Val 


Thr Leu Gly 


Gin 


Phe 








845 








Tyr 


Arg 


Tyr 


Tyr 


Phe 


Lys 


Lys 






860 










Val 


Phe 


Glu 


Glu 


Val 




Glu 




875 










880 


Glu 


Lys 


He 


He 


Gly 


Lys 


Val 


890 










895 




Val 


Ala 


Pro 


Val 


Leu 


Ser 


He 


10 










15 




He 


Lys 


Lys 


Gin 


Leu 


Leu 


Leu 










30 






Leu 


Leu 


His 


Ser 


Ser 


Lys 


Trp 








45 








Ala 


Leu 


Pro 


Leu 


Ala 


Glu 


Leu 






60 










Asp 


Ala 


Gin Asp 


Met 


Asp 


Ala 




75 










80 


Val 


Lys 


Glu 


Tyr 


Asp 


Arg 


Ala 


90 










95 




Lys 


Lys 


Ala 


Tyr 


Phe 


Leu 


Tyr 










110 






Lys 


Lys 


Lys 


Asp 


Asp 


Glu 


Thr 








12 5 








Gly 


Gin 


Val 


Lys 


Asn 


Glu 


Ala 






140 










Lys 


Lys 


His 


Gin 


Ala 


Arg 


Glu 




155 










160 


Gly 


Val 


Val 


Leu Arg 


Lys 


Leu 


170 










175 




Phe 


Val 


Glu 


Ala 


Thr 


His 


Val 










190 






Glu 


Leu 


Cys 


Asn 


Leu 


He 


Thr 








205 








Leu 


Pro 


Asp 


Thr 


Trp 


Met 


Lys 



220 
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Glu Phe Phe Leu Ala His lie Tyr Thr Glu Leu Gin Leu lie Glu Glu 
225 230 235 240 

Ala Leu Gin Lys Tyr Gin Asn Leu lie Asp Val Gly Phe Ser Lys Ser 

245 250 255 

Ser Tyr He Val Ser Gin He Ala Val Ala Tyr His Asn He Arg Asp 

260 265 270 

He Asp Lys Ala Leu Ser He Phe Asn Glu Leu Arg Lys Gin Asp Pro 

275 280 285 

Tyr Arg He Glu Asn Met Asp Thr Phe Ser Asn Leu Leu Tyr Val Arg 

290 295 300 

Ser Met Lys Ser Glu Leu Ser Tyr Leu Ala His Asn Leu Cys Glu lie 
305 " 310 315 320 

Asp Lys Tyr Arg Val Glu Thr Cys Cys Val lie Gly Asn Tyr Tyr Ser 

325 330 335 

Leu Arg Ser Gin His Glu Lys Ala Ala Leu Tyr Phe Gin Arg Ala Leu 

340 345 350 

Lys Leu Asn Pro Arg Tyr Leu Gly Ala Trp Thr Leu Met Gly His Glu 

355 360 365 

Tyr Met Glu Met Lys Asn Thr Ser Ala Ala He Gin Ala Tyr Arg His 

370 375 380 

Ala He Glu Val Asn Lys Arg Asp Tyr Arg Ala Trp Tyr Gly Leu Gly 
385 390 395 400 

Gin Thr Tyr Glu He Leu Lys Met Pro Phe Tyr Cys Leu Tyr Tyr Tyr 

405 410 415 

Arg Arg Ala His Gin Leu Arg Pro Asn Asp Ser Arg Met Leu Val Ala 

420 425 430 

Leu Gly Glu Cys Tyr Glu Lys Leu Asn Gin Leu Val Glu Ala Lys Lys 

435 440 445 

Cys Tyr Trp Arg Ala Tyr Ala Val Gly Asp Val Glu Lys Met Ala Leu 

450 455 460 

Val Lys Leu Ala Lys Leu His Glu Gin Leu Thr Glu Ser Glu Gin Ala 
465 470 475 480 

Ala Gin Cys Tyr lie Lys Tyr He Gin Asp lie Tyr Ser Cys Gly Glu 

485 490 495 

He Val Glu His Leu Glu Glu Ser Thr Ala Phe Arg Tyr Leu Ala Gin 

500 505 510 

Tyr Tyr Phe Lys Cys Lys Leu Trp Asp Glu Ala Ser Thr Cys Ala Gin 

515 520 525 

Lys Cys Cys Ala Phe Asn Asp Thr Arg Glu Glu Gly Lys Ala Leu Leu 

530 535 540 

Arg Gin He Leu Gin Leu Arg Asn Gin Gly Glu Thr Pro Thr Thr Glu 
545 550 555 560 

Val Pro Ala Pro Phe Phe Leu Pro Ala Ser Leu Ser Ala Asn Asn Thr 

565 570 575 

Pro Thr Arg Arg Val Ser Pro Leu Asn Leu Ser Ser Val Thr Pro 
580 585 590 

<210> 672 
<211> 914 
<212> PRT 

<213> Homo sapiens 
<400> 672 

Val Tyr Gin Val Leu Leu Val Gly Ser Thr Leu Leu Lys Glu Val Pro 
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1 



5 



10 



15 



Ser Gly Leu Gin Leu Glu Gin Leu Pro Ser Gin Ser Leu Leu Thr His 

20 25 30 

lie Pro Thr Ala Gly Leu Pro Thr Ser Leu Gly Gly Gly Leu Pro Tyr 

35 40 45 

Cys His Gin Ala Trp Leu Asp Phe Arg Arg Arg Leu Glu Ala Leu Leu 

50 55 60 

Gin Asn Cys Gin Ala Ala Cys Ala Leu Leu Gin Gly Ala lie Glu Ser 
65 70 75 80 

Val Lys Ala Val Pro Gin Pro Met Glu Pro Gly Glu Val Gly Gin Leu 

85 90 95 

Leu Gin Gin Thr Glu Val Leu Met Gin Gin Val Leu Asp Ser Pro Trp 

100 105 110 

Leu Ala Trp Leu Gin Cys Gin Gly Gly Arg Glu Leu Thr Trp Leu Lys 

115 120 125 

Gin Glu Val Pro Glu Val Thr Leu Ser Pro Asp Tyr Arg Thr Ala Met 

13 0 135 14 0 

Asp Lys Ala Asp Glu Leu Tyr Asp Arg Val Asp Gly Leu Leu His Gin 
145 150 155 160 

Leu Thr Leu Gin Ser Asn Gin Arg lie Gin Ala Leu Glu Leu Val Gin 

165 170 175 

Thr Leu Glu Ala Arg Glu Ser Gly Leu His Gin lie Glu Val Trp Leu 

180 185 190 

Gin Gin Val Gly Trp Pro Ala Leu Glu Glu Ala Gly Glu Pro Ser Leu 

195 200 205 

Asp Met Leu Leu Gin Ala Gin Gly Ser Phe Gin Glu Leu Tyr Gin Val 

210 215 220 

Ala Gin Glu Gin Val Arg Gin Gly Glu Lys Phe Leu Gin Pro Leu Thr 
225 230 235 240 

Gly Trp Glu Ala Ala Glu Leu Asp Pro Pro Gly Ala Arg Phe Leu Ala 

245 250 255 

Leu Arg Ala Gin Leu Thr Glu Phe Ser Arg Ala Leu Ala Gin Arg Cys 

260 265 270 

Gin Arg Leu Ala Asp Ala Glu Arg Leu Phe Gin Leu Phe Arg Glu Ala 

275 280 285 

Leu Thr Trp Ala Glu Glu Gly Gin Arg Val Leu Ala Glu Leu Glu Gin 

290 295 300 

Glu Arg Pro Gly Val Val Leu Gin Gin Leu Gin Leu His Trp Thr Arg 
305 310 315 320 

His Pro Asp Leu Pro Pro Ala His Phe Arg Lys Met Trp Ala Leu Ala 

325 330 335 

Thr Gly Leu Gly Ser Glu Ala lie Arg Gin Glu Cys Arg Trp Ala Trp 

340 345 350 

Ala Arg Cys Gin Asp Thr Trp Leu Ala Leu Asp Gin Lys Leu Glu Ala 

355 360 365 

Ser Leu Lys Leu Pro Pro Val Gly Ser Thr Ala Ser Leu Cys Val Ser 

370 375 380 

Gin Val Pro Ala Ala Pro Ala His Pro Pro Leu Arg Lys Ala Tyr Ser 
385 390 395 400 

Phe Asp Arg Asn Leu Gly Gin Ser Leu Ser Glu Pro Ala Cys His Cys 

405 410 415 

His His Ala Ala Thr lie Ala Ala Cys Arg Arg Pro Glu Ala Gly Gly 

420 425 430 

Gly Ala Leu Pro Gin Ala Ser Pro Thr Val Pro Pro Pro Gly Ser Ser 
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435 440 445 

Asp Pro Arg Ser Leu Asn Arg Leu Gin Leu Val Leu Ala Glu Met Val 

450 455 460 

Ala Thr Glu Arg Glu Tyr Val Arg Ala Leu Glu Tyr Thr Met Glu Asn 
465 ^ 470 475 480 

Tyr Phe Pro Glu Leu Asp Arg Pro Asp Val Pro Gin Gly Leu Arg Gly 

485 490 495 

Gin Arg Ala His Leu Phe Gly Asn Leu Glu Lys Leu Arg Asp Phe His 

500 505 510 

Cys His Phe Phe Leu Arg Glu Leu Glu Ala Cys Thr Arg His Pro Pro 

515 520 525 

Arg Val Ala Tyr Ala Phe Leu Arg His Arg Val Gin Phe Gly Met Tyr 

530 535 540 

Ala Leu Tyr Ser Lys Asn Lys Pro Arg Ser Asp Ala Leu Met Ser Ser 
545 ~ ^ 550 555 560 

Tyr Gly His Thr Phe Phe Lys Asp Lys Gin Gin Ala Leu Gly Asp His 

565 570 575 

Leu Asp Leu Ala Ser Tyr Leu Leu Lys Pro lie Gin Arg Met Gly Lys 

580 585 590 

Tyr Ala Leu Leu Leu Gin Glu Leu Ala Arg Ala Cys Gly Gly Pro Thr 

595 600 605 

Gin Glu Leu Ser Ala Leu Arg Glu Ala Gin Ser Leu Val His Phe Gin 

610 615 620 

Leu Arg His Gly Asn Asp Leu Leu Ala Met Asp Ala lie Gin Gly Cys 
625 ~ ' 630 635 640 

Asp Val Asn Leu Lys Glu Gin Gly Gin Leu Val Arg Gin Asp Glu Phe 

645 650 655 

Val Val Arg Thr Gly Arg His Lys Ser Val Arg Arg He Phe Leu Phe 

660 665 670 

Glu Glu Leu Leu Leu Phe Ser Lys Pro Arg His Gly Pro Thr Gly Val 

675 680 685 

Asp Thr Phe Ala Tyr Lys Arg Ser Phe Lys Met Ala Asp Leu Gly Leu 

690 695 700 

Thr Glu Cys Cys Gly Asn Ser Asn Leu Arg Phe Glu lie Trp Phe Arg 
705 710 715 720 

Arg Arg Lys Ala Arg Asp Thr Phe Val Leu Gin Ala Ser Ser Leu Ala 

725 730 735 

He Lys Gin Ala Trp Thr Ala Asp He Ser His Leu Leu Trp Arg Gin 

740 745 750 

Ala Val His Asn Lys Glu Val Arg Met Ala Glu Met Val Ser Met Gly 

755 760 765 

Val Gly Asn Lys Ala Phe Arg Asp He Ala Pro Ser Glu Glu Ala He 

770 775 780 

Asn Asp Arg Thr Val Asn Tyr Val Leu Lys Cys Arg Glu Val Arg Ser 
785 790 795 800 

Arg Ala Ser He Ala Val Ala Pro Phe Asp His Asp Ser Leu Tyr Leu 

805 810 815 

Gly Ala Ser Asn Ser Leu Pro Gly Asp Pro Ala Ser Cys Ser Val Leu 

820 825 830 

Gly Ser Leu Asn Leu His Leu Tyr Arg Asp Pro Ala Leu Leu Gly Leu 

835 840 845 

Arg Cys Pro Leu Tyr Pro Ser Phe Leu Glu Glu Ala Ala Leu Glu Ala 

850 855 860 

Glu Ala Glu Leu Gly Gly Gin Pro Ser Leu Thr Ala Glu Asp Ser Glu 
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865 870 
lie Ser Ser Gin Cys Pro Ser Ala 
885 

Ser Cys Val Ser Gly Gin Ala Leu 
900 

Cys Val 



875 880 
Ser Gly Ser Ser Gly Ser Asp Ser 

890 895 
Gly Arg Gly Leu Glu Asp Leu Pro 
905 910 



<210> 673 
<211> 277 
<212> PRT 
<213> Homo sapiens 



<400> 673 



Leu 


Asn 


Tyr 


Leu Leu Glu Ser 


Arg 


Leu 


Glu 


Ala 


Ala 


Ala 


His 


Cys 


Ala 


1 






5 






10 










15 




Leu 


Lys 


Gin Gly lie Ala Thr 


Ala 


Ser 


Leu 


Leu 


Pro 


Ala 


Gin 


Leu 


Gin 








20 




25 










30 






Pro 


Ala 


Val 


Leu Thr Val Val 


Thr 


Cys 


His 


Val 


Val 


Val 


Ser 


Val 


His 






35 




40 










45 








Gly His 


His 


Thr Asp Gly Cys 


Leu 


Ala 


Ala 


Leu 


Cys 


Arg 


Glu Asp 


Arg 




50 




55 










60 










Thr 


Gly 


Thr Gly Gly Ala Phe 


Trp 


Cys 


Lys 


Asn Arg 


Val 


lie 


Val 


Ser 


65 






70 








75 










80 


His 


Ala 


Val 


Asp Val Val Leu 


His 


Val 


His 


Gly Glu 


Gly 


Asn 


Pro 


Val 








85 






90 










95 




Gin 


Ala 


Leu 


lie Ala His Gly Ala 


Pro 


Glu 


Ala 


Ala 


Trp 


Val 


Val 


Gly 








100 




105 










110 






Leu 


Ala 


Gin Gly Leu Gin Asp 


His 


Phe 


His 


Asp 


Glu 


Met 


Ser 


Thr 


His 






115 




120 










125 








Ala 


Ala 


Phe 


Val Gly Arg Leu 


Leu 


Glu Pro Gly Val 


Gin 


Glu 


Val 


Leu 




130 




135 










140 










Leu 


Ala 


Val 


His Phe Leu Thr 


His 


Val 


Val 


Glu 


Arg 


Leu 


Pro 


Thr 


Glu 


145 






150 








155 










160 


Ser 


Ser 


Pro Thr Arg Val Ala 


Gly 


Glu 


Ala 


Val 


Ser 


Val 


He 


Lys 


Thr 








1.65 






170 










175 




Pro 


His 


Cys 


Leu Ala Arg Leu 


Leu 


Gly 


Ser 


Val 


Asp 


Ala 


Lys 


Pro 


Thr 








180 




185 










190 






Leu 


Asp 


Ala 


Asn Ala Glu Val 


Val 


Pro 


Arg Arg 


Ala 


Arg 


Leu 


Glu 


Arg 






195 




200 










205 








Pro 


Leu 


Gin 


Leu Pro Gly Glu Arg 


Leu 


Gin 


Pro 


Pro 


Leu 


Gly Arg 


Ala 




210 




215 










220 










Trp 


Ala 


Ala 


Leu Pro Ala Arg 


Gly 


Gin 


Arg 


Glu 


Cys 


Arg 


Gin Arg 


Glu 


225 






230 








235 










240 


Gly Gly 


Arg 


Pro Arg Arg Leu 


Arg 


Gly Ala 


Ser Gly Arg 


Gly Ala 


Gly 








245 






250 










255 




Ala 


Gly 


Arg 


Glu Glu Val Ser 


Val 


Gly Phe 


Ser 


Ala 


Gin 


Trp 


Glu 


Phe 








260 




265 










270 







Gly Ser Gly Arg His 
275 



<210> 674 
<211> 1120 
<212> PRT 
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<213> Homo sapiens 
<400> 674 

Met Trp Arg Val Lys Lys Leu Ser Leu Ser Leu Ser Pro Ser Pro Gin 

X ~ ~ 5 10 15 

Thr Gly Lys Pro Ser Met Arg Thr Pro Leu Arg Glu Leu Thr Leu Gin 

20 25 30 

Pro Gly Ala Leu Thr Thr Ser Gly Lys Arg Ser Pro Ala Cys Ser Ser 

35 40 45 

Leu Thr Pro Ser Leu Cys Lys Leu Gly Leu Gin Glu Gly Ser Asn Asn 

50 55 60 

Ser Ser Pro Val Asp Phe Val Asn Asn Lys Arg Thr Asp Leu Ser Ser 
65 70 75 80 

Glu His Phe Ser His Ser Ser Lys Trp Leu Glu Thr Cys Gin His Glu 

85 90 95 

Ser Asp Glu Gin Pro Leu Asp Pro lie Pro Gin lie Ser Ser Thr Pro 

100 105 110 

Lys Thr Ser Glu Glu Ala Val Asp Pro Leu Gly Asn Tyr Met Val Lys 

115 120 12 5 

Thr lie Val Leu Val Pro Ser Pro Leu Gly Gin Gin Gin Asp Met lie 

130 135 140 

Phe Glu Ala Arg Leu Asp Thr Met Ala Glu Thr Asn Ser He Ser Leu 
145 150 155 160 

Asn Gly Pro Leu Arg Thr Asp Asp Leu Val Arg Glu Glu Val Ala Pro 

165 170 175 

Cys Met Gly Asp Arg Phe Ser Glu Val Ala Ala Val Ser Glu Lys Pro 

180 185 190 

He Phe Gin Glu Ser Pro Ser His Leu Leu Glu Glu Ser Pro Pro Asn 

195 200 205 

Pro Cys Ser Glu Gin Leu His Cys Ser Lys Glu Ser Leu Ser Ser Arg 

210 215 220 

Thr Glu Ala Val Arg Glu Asp Leu Val Pro Ser Glu Ser Asn Ala Phe 
225 230 235 240 

Leu Pro Ser Ser Val Leu Trp Leu Ser Pro Ser Thr Ala Leu Ala Ala 

245 250 255 

Asp Phe Arg Val Asn His Val Asp Pro Glu Glu Glu He Val Glu His 

260 265 270 

Gly Ala Met Glu Glu Arg Glu Met Arg Phe Pro Thr His Pro Lys Glu 

275 280 285 

Ser Glu Thr Glu Asp Gin Ala Leu Val Ser Ser Val Glu Asp He Leu 

290 295 300 

Ser Thr Cys Leu Thr Pro Asn Leu Val Glu Met Glu Ser Gin Glu Ala 
305 310 315 320 

Pro Gly Pro Ala Val Glu Asp Val Gly Arg He Leu Gly Ser Asp Thr 

325 330 335 

Glu Ser Trp Met Ser Pro Leu Ala Trp Leu Glu Lys Gly Val Asn Thr 

340 345 350 

Ser Val Met Leu Glu Asn Leu Arg Gin Ser Leu Ser Leu Pro Ser Met 

355 360 365 

Leu Arg Asp Ala Ala He Gly Thr Thr Pro Phe Ser Thr Cys Ser Val 

370 375 380 

Gly Thr Trp Phe Thr Pro Ser Ala Pro Gin Glu Lys Ser Thr Asn Thr 
385 390 395 400 

Ser Gin Thr Gly Leu Val Gly Thr Lys His Ser Thr Ser Glu Thr Glu 



-231- 



BNSDOCID: <WO. 



02092000A2_I_> 




WO 02/092000 PCT/US02/14877 









405 










410 










415 




Gin 


Leu 


Leu Cys 


Gly Arg 


Pro 


Pro 


Asp 


Leu 


Thr 


Ala 


Leu 


Ser 


Arg 


His 






420 










425 










430 






Asp 


Leu 


Glu Asp 


Asn 


Leu 


Leu 


Ser 


Ser 


Leu 


Val 


He 


Val 


Glu 


Phe 


Leu 






435 








440 










445 








Ser Arg 


Gin Leu 


Arg Asp 


Trp 


Lys 


Ser 


Gin 


Leu 


Ala 


Val 


Pro 


His 


Pro 




450 








455 










460 










Glu 


Thr 


Gin Asp 


Ser 


Ser 


Thr Gin Thr Asp 


Thr 


Ser 


His 


Ser 


Gly 


He 


465 








470 










475 










480 


Thr 


Asn 


Lys Leu 


Gin 


His 


Leu 


Lys 


Glu 


Ser 


His 


Glu 


Met 


Gly 


Gin 


Ala 








485 










490 










495 




Leu 


Gin 


Gin Ala 


Arg 


Asn 


Val 


Met 


Gin 


Ser 


Trp 


Val 


Leu 


He 


Ser 


Lys 






500 










505 










510 






Glu 


Leu 


lie Ser 


Leu 


Leu 


His 


Leu 


Ser 


Leu 


Leu 


His 


Leu 


Glu 


Glu 


Asp 






515 








520 










525 








Lys 


Thr 


Thr Val 


Asn 


Gin 


Glu 


Ser 


Arg Arg 


Ala 


Glu 


Thr 


Leu 


Val 


Cys 




530 








535 










540 










Cys 


Cys 


Phe Asp 


Leu 


Leu 


Lys 


Lys 


Leu Arg 


Ala 


Lys 


Leu 


Gin 


Ser 


Leu 


545 








550 










555 










560 


Lys 


Ala 


Glu Arg 


Glu Glu Ala Arg 


His 


Arg 


Glu 


Glu 


Met 


Ala 


Leu 


Arg 








565 










570 










575 




Gly Lys 


Asp Ala 


Ala 


Glu 


He 


Val 


Leu 


Glu 


Ala 


Phe 


Cys 


Ala 


His 


Ala 






580 










585 










590 






Ser 


Gin 


Arg lie 


Ser Gin Leu Glu Gin Asp 


Leu 


Ala 


Ser 


Met 


Arg 


Glu 






595 








600 










605 








Phe 


Arg 


Gly Leu 


Leu 


Lys 


Asp 


Ala 


Gin 


Thr 


Gin 


Leu 


Val 


Gly 


Leu 


His 




610 








615 










620 










Ala 


Lys 


Gin Glu 


Glu 


Leu 


Val 


Gin 


Gin 


Thr 


Val 


Ser 


Leu 


Thr 


Ser 


Thr 


625 








630 










635 










640 


Leu 


Gin 


Gin Asp 


Trp Arg 


Ser 


Met 


Gin 


Leu 


Asp 


Tyr 


Thr 


Thr 


Trp 


Thr 








645 










650 










655 




Ala 


Leu 


Leu Ser 


Arg 


Ser 


Arg 


Gin 


Leu 


Thr 


Glu 


Lys 


Leu 


Thr 


Val 


Lys 






660 










665 










670 






Ser 


Gin 


Gin Ala 


Leu 


Gin 


Glu Arg 


Asp 


Val 


Ala 


lie 


Glu 


Glu 


Lys 


Gin 






675 








680 










685 








Glu 


Val 


Ser Arg 


Val 


Leu 


Glu 


Gin 


Val 


Ser 


Ala 


Gin 


Leu 


Glu 


Glu 


Cys 




690 








695 










700 










Lys 


Gly 


Gin Thr 


Glu 


Gin 


Leu 


Glu 


Leu 


Glu 


Asn 


He 


Arg 


Leu 


Ala 


Thr 


705 








710 










715 










720 


Asp 


Leu 


Arg Ala 


Gin 


Leu 


Gin 


He 


Leu 


Ala 


Asn 


Met 


Asp 


Ser 


Gin 


Leu 








725 










730 










735 




Lys 


Glu 


Leu Gin 


Ser 


Gin 


His 


Thr 


His 


Cys 


Ala 


Gin 


Asp 


Leu 


Ala 


Met 






740 










745 










750 






Lys 


Asp 


Glu Leu 


Leu 


Cys 


Gin 


Leu 


Thr 


Gin 


Ser 


Asn 


Glu 


Glu 


Gin 


Ala 






755 








760 










765 








Ala 


Gin 


Cys Val 


Lys 


Glu 


Glu 


Met 


Ala 


Leu 


Lys 


His 


Met 


Gin 


Ala 


Glu 




770 








775 










780 










Leu 


Gin 


Gin Gin 


Gin 


Ala 


Val 


Leu 


Ala 


Lys 


Glu 


Val 


Arg 


Asp 


Leu 


Lys 


785 








790 










795 










800 


Glu 


Thr 


Leu Glu 


Phe 


Ala 


Asp 


Gin 


Glu 


Asn 


Gin 


Val 


Ala 


His 


Leu 


Glu 








805 










810 










815 




Leu Gly 


Gin Val 


Glu 


Cys 


Gin 


Leu 


Lys 


Thr 


Thr 


Leu 


Glu 


Val 


Leu 


Arg 






820 










825 










830 






Glu 


Arg 


Ser Leu 


Gin 


Cys 


Glu 


Asn 


Leu 


Lys 


Asp 


Thr 


Val 


Glu 


Asn 


Leu 
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835 










840 










845 








Thr 


Ala 


Lys 


Leu 


Ala 


Ser 


Thr 


He 


Ala 


Asp 


Asn 


Gin 


Glu 


Gin Asp 


Leu 




850 










855 










860 










Glu 


Lys 


Thr 


Arg 


Gin 


Tyr 


Ser 


Gin 


Lys 


Leu 


Gly 


Leu 


Leu 


Thr 


Glu 


Gin 


865 










870 










875 










880 


Leu 


Gin 


Ser 


Leu 


Thr 


Leu 


Phe 


Leu 


Gin 


Thr 


Lys 


Leu 


Lys 


Glu 


Lys 


Thr 










885 










890 










895 




Glu 


Gin 


Glu 


Thr 


Leu 


Leu 


Leu 


Ser 


Thr 


Ala 


Cys 


Pro 


Pro 


Thr 


Gin 


Glu 








900 










905 










910 






His 


Pro 


Leu 


Pro 


Asn 


Asp 


Arg 


Thr 


Phe 


Leu 


Gly 


Ser 


He 


Leu 


Thr 


Ala 






915 










920 










925 








Val 


Ala 


Asp 


Glu 


Glu 


Pro 


Glu 


Ser 


Thr 


Pro 


Val 


Pro 


Leu 


Leu Gly 


Ser 




930 










935 










94 0 










Asp 


Lys 


Ser 


Ala 


Phe 


Thr 


Arg 


Val 


Ala 


Ser 


Met 


Val 


Ser 


Leu 


Gin 


Pro 


945 








950 










955 










960 


Ala 


Glu 


Thr 


Pro 


Gly 


Met 


Glu 


Glu 


Ser 


Leu 


Ala 


Glu 


Met 


Ser 


He 


Met 










965 










970 










975 




Thr 


Thr 


Glu 


Leu 


Gin 


Ser 


Leu 


Cys 


Ser 


Leu 


Leu 


Gin 


Glu 


Ser 


Lys 


Glu 








980 










985 










990 






Glu 


Ala 


He 


Arg 


Thr 


Leu 


Gin 


Arg 


Lys 


He 


Cys 


Glu 


Leu 


Gin 


Ala 


Arg 






995 










1000 








1005 






Leu 


Gin 


Ala 


Gin 


Glu 


Glu 


Gin 


His 


Gin 


Glu 


Val 


Gin 


Lys 


Ala 


Lys 


Glu 




1010 








1015 








1020 








Ala 


Asp 


He 


Glu 


Lys 


Leu 


Asn 


Gin 


Ala 


Leu 


Cys 


Leu 


Arg 


Tyr 


Lys 


Asn 


1021 








1030 








1035 








104i 


Glu 


Lys 


Glu 


Leu 


Gin 


Glu 


Val 


He 


Gin 


Gin 


Asn 


Glu 


Lys 


He 


Leu 


Glu 



1045 1050 1055 



Gin He Asp Lys Ser Gly Glu Leu He Ser Leu Arg Glu Glu Val Thr 

1060 1065 1070 

His Leu Thr Arg Ser Leu Arg Arg Ala Glu Thr Glu Thr Lys Val Leu 

1075 1080 1085 

Gin Glu Ala Trp Gin Ala Ser Trp Thr Pro Thr Ala Ser Leu Trp Pro 

1090 1095 1100 

Pro He Gly Ser Arg Arg Lys Cys Gly Ser Leu Arg Arg Trp Thr Asn 
1105 ~ *" 1110 H15 H20 



<210> 675 
<211> 540 
<212> PRT 

<213> Homo sapiens 



<400> 675 






















Met 


Gly Thr 


Thr 


Ala Arg Ala Ala 


Leu 


Val 


Leu 


Thr 


Tyr 


Leu 


Ala 


Val 


1 




5 




10 










15 




Ala 


Ser Ala 


Ala 


Ser Glu Gly Gly Phe Thr 


Ala 


Thr Gly Gin Arg 


Gin 






20 




25 










30 






Leu 


Arg Pro 


Glu 


His Phe Gin Glu 


Val 


Gly 


Tyr 


Ala 


Ala 


Pro 


Pro 


Ser 




35 




40 










45 








Pro 


Pro Leu 


Ser 


Arg Ser Leu Pro 


Met 


Asp 


His 


Pro 


Asp 


Ser 


Ser 


Gin 




50 




55 








60 










His 


Gly Pro 


Pro 


Phe Glu Gly Gin 


Ser 


Gin 


Val 


Gin 


Pro 


Pro 


Pro 


Ser 


65 




70 






75 










80 


Gin 


Glu Ala 


Thr 


Pro Leu Gin Gin 


Glu 


Lys 


Leu 


Leu 


Pro 


Ala 


Gin 


Leu 








85 




90 










95 
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Pro 


Ala 


Glu 


Lys 


Glu 


Val 


Gly Pro 


Pro 








100 








105 


Leu 


Gin 


Lys 


Glu 


Leu 


Pro 


Ser Leu 


Gin 






115 








120 




Gly 


Thr 


Pro 


Ala 


Pro 


Phe 


Gly Asp 


Gin 




130 










135 




Trp 


Asn 


Ala 


Ala 


Gin 


His 


Cys Gin 


Gin 


145 










150 






Gly His 


Arg 


Leu 


Asp 


Gly Phe Pro 


Pro 










165 








Leu 


Asn 


Gin 


He 


Cys 


Leu 


Pro Asn 


Arg 








180 








185 


Trp Asn 


Leu 


Pro 


Gin 


Ser 


Ser Tyr 


Ser 






195 








200 




Thr 


Leu 


Asn 


Phe 


Leu 


Glu 


He Gly 


Tyr 




210 










215 




Ser 


His 


Thr 


Asn 


Arg 


Leu 


Glu Cys 


Ala 


225 










230 






Met 


Ser 


Arg 


Phe 


Cys 


Glu 


Ala Glu 


Phe 










245 








Trp 


Cys 


Cys 


Thr Arg Gin Gly Glu Ala 








260 








265 


Glu 


Ala 


Pro 


Gin 


Pro 


His 


Tyr Gin 


Leu 






275 








280 




Pro Asp 


lie 


Ser 


Ser 


Gly Leu Glu 


Leu 




290 










295 




Thr 


Leu 


Asp 


Asn 


He 


Lys 


Asn He 


Cys 


305 










310 






Val 


Pro 


Arg 


Asn 


Leu 


Pro 


Ala Thr 


Asp 










325 








Ala 


Leu 


lie 


Gin 


Leu Glu Arg Glu 


Phe 








340 








345 


Asn 


Asn 


His 


Thr 


Cys 


Thr 


Trp Lys 


Ala 






355 








360 




Tyr 


Cys 


Asp 


Arg 


Glu 


Tyr 


Ala Val 


Lys 




370 










375 




Arg His 


Pro 


Pro 


Ser 


Pro 


Thr Arg 


Asp 


385 










390 






Pro 


Tyr 


Pro 


Asn 


Tyr Asp Arg Asp 


He 










405 








Val 


Thr 


Pro 


Asn 


Leu 


Met 


Gly His 


Leu 








420 








425 


Thr 


Lys 


His 


Lys 


His 


He 


Pro Gly Leu 






435 








440 




Cys 


Cys 


Asp 


Leu 


Pro 


Phe 


Pro Glu 


Gin 




450 










455 




Lys 


Leu 


Thr 


Phe 


He 


Asn 


Asp Leu 


Cys 


465 










470 






Arg Asp 


Pro 


Ala 


Leu 


Cys 


Cys Tyr 


Leu 










485 








Asn 


Cys 


Phe 


Asn 


He 


Asn 


Tyr Leu Arg 








500 








505 


Asp 


Thr 


Glu 


Asn 


Ala 


Lys 


Gly Gin Gly 






515 








520 
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Leu 


Pro 


Gin 


Glu 


Ala 


Val 


Pro 










110 






His 


Pro 


Asn 


Glu 


Gin 


Lys 


Glu 








125 








Ser 


His 


Pro 


Glu 


Pro 


Glu 


Ser 






140 










Asp 


Arg 


Ser 


Gin 


Gly 


Gly 


Trp 




155 










160 


Gly 


Arg 


Pro 


Ser 


Pro 


Asp 


Asn 


170 










175 




Gin 


His 


Val 


Val 


Tyr 


Gly 


Pro 










190 






His 


Leu 


Thr Arg 


Gin 


Gly 


Glu 








205 








Ser 


Arg 


Cys 


Cys 


His 


Cys 


Arg 






220 










Lys 


Leu 


Val 


Trp 


Glu 


Glu 


Ala 




235 










240 


Ser 


Val 


Lys 


Thr 


Arg 


Pro 


His 


250 










255 




Arg 


Phe 


Ser 


Cys 


Phe 


Gin 


Glu 










270 






Arg 


Ala 


Cys 


Pro 


Ser 


His 


Gin 








285 








Pro 


Phe 


Pro 


Pro 


Gly 


Val 


Pro 






300 










His 


Leu 


Arg 


Arg 


Phe 


Arg 


Ser 




315 










320 


Pro 


Leu 


Gin 


Arg 


Glu 


Leu 


Leu 


330 










335 




Gin 


Arg 


Cys 


Cys 


Arg 


Gin 


Gly 










350 






Trp 


Glu 


Asp 


Thr 


Leu 


Asp 


Lys 








365 








Thr 


His 


His 


His 


Leu 


Cys 


Cys 






380 










Glu 


Cys 


Phe 


Ala 


Arg 


Arg 


Ala 




395 










400 


Leu 


Thr 


He 


Asp 


He 


Ser 


Arg 


410 










415 




Cys 


Gly 


Asn 


Gin 


Arg 


Val 


Leu 










430 






He 


His 


Asn 


Met 


Thr 


Ala 


Arg 








445 








Ala 


Cys 


Cys 


Ala 


Glu 


Glu 


Glu 






460 










Gly 


Pro 


Arg 


Arg 


Asn 


He 


Trp 




475 










480 


Ser 


Pro 


Gly Asp 


Glu 


Gin 


Val 


490 










495 




Asn 


Val 


Ala 


Leu 


Val 


Ser 


Gly 










510 






Glu 


Gin 


Gly 


Ser 


Thr 


Gly 


Gly 



525 



-234- 



BNSDOCID: <WO 02092000 A2_l_> 



WO 02/092000 



PCT/US02/14877 



Thr Asn lie Ser Ser Thr Ser Glu Pro Lys Glu Glu 
530 535 540 

<210> 676 
<211> 462 
<212> PRT 
<213> Homo sapiens 



<400> 676 



Met 


Gly 


Lys Glu Lys Thr His 


He 


Asn 


He 


Val 


Val 


He 


Gly His 


Val 


1 




5 






10 










15 




Asp 


Ser 


Gly Lys Ser Thr Thr 


Thr 


Gly 


His 


Leu 


He 


Tyr 


Lys 


Cys 


Gly 






20 




25 










30 






Gly 


He 


Asp Lys Arg Thr He 


Glu 


Lys 


Phe 


Glu 


Lys 


Glu 


Ala 


Ala 


Glu 






35 


40 










45 








Met 


Gly 


Lys Gly Ser Phe Lys 


Tyr 


Ala 


Trp 


Val 


Leu 


Asp 


Lys 


Leu 


Lys 




50 


55 










60 










Ala 


Glu 


Arg Glu Arg Gly He 


Thr 


He 


Asp 


He 


Ser 


Leu 


Trp 


Lys 


Phe 


65 




70 








75 










80 


Glu 


Thr 


Ser Lys Tyr Tyr Val 


Thr 


He 


He 


Asp 


Ala 


Pro 


Gly His 


Arg 






85 






90 










95 




Asp 


Phe 


He Lys Asn Met He 


Thr 


Gly 


Thr 


Ser 


Gin 


Ala 


Asp 


Cys 


Ala 






100 




105 










110 






Val 


Leu 


He Val Ala Ala Gly 


Val 


Gly Glu 


Phe 


Glu 


Ala 


Gly 


He 


Ser 






115 


120 










125 








Lys 


As ii 


Gly Gin Thr Arg Glu 


His 


Ala 


Leu 


Leu 


Ala 


Tyr 


Thr 


Leu 


Gly 




130 


135 










140 










Val 


Lys 


Gin Leu He Val Gly 


Val 


Asn 


Lys 


Met 


Asp 


Ser 


Thr 


Glu 


Pro 


145 




150 








155 










160 


Pro 


Tyr 


Ser Gin Lys Arg Tyr 


Glu 


Glu 


He 


Val 


Lys 


Glu 


Val 


Ser 


Thr 






165 






170 










175 




Tyr 


He 


Lys Lys He Gly Tyr 


Asn 


Pro 


Asp 


Thr 


Val 


Ala 


Phe 


Val 


Pro 






180 




185 










190 






He 


Ser 


Gly Trp Asn Gly Asp 


Asn 


Met 


Leu 


Glu 


Pro 


Ser 


Ala 


Asn 


Met 






195 


200 










205 








Pro 


Trp 


Phe Lys Gly Trp Lys 


Val 


Thr 


Arg 


Lys 


Asp 


Gly 


Asn 


Ala 


Ser 




210 


215 










220 










Gly 


Thr 


Thr Leu Leu Glu Ala 


Val 


Asp 


Cys 


He 


Leu 


Pro 


Pro 


Thr 


Arg 


225 




230 








235 










240 


Pro 


Thr 


Asp Lys Pro Leu Arg 


Leu 


Pro 


Leu 


Gin 


Asp 


Val 


Tyr 


Lys 


He 






245 






250 










255 




Gly 


Gly 


He Gly Thr Val Pro 


Val 


Gly Arg 


Val 


Glu 


Thr 


Gly Val 


Leu 






260 




265 










270 






Lys 


Pro 


Gly Met Val Val Thr 


Phe 


Ala 


Pro 


Val 


Asn 


Val 


Thr 


Thr 


Glu 






275 


280 










285 








Val 


Lys 


Ser Val Glu Met His 


His 


Glu 


Ala 


Leu 


Ser 


Glu 


Ala 


Leu 


Pro 




290 


295 










300 










Gly 


Asp 


Asn Val Gly Phe Asn 


Val 


Lys 


Asn 


Val 


Ser 


Val 


Lys 


Asp 


Val 


305 




310 








315 










320 


Arg 


Arg 


Gly Asn Val Ala Gly 


Asp 


Ser 


Lys 


Asn 


Asp 


Pro 


Pro 


Met 


Glu 






325 






330 










335 




Ala 


Ala 


Gly Phe Thr Ala Gin 


Val 


He 


He 


Leu 


Asn 


His 


Pro Gly 


Gin 






340 




345 










350 






He 


Ser 


Ala Gly Tyr Ala Pro 


Val 


Leu Asp 


Cys 


His 


Thr 


Ala 


His 


He 
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355 








360 




Ala 


Cys 


Lys 


Phe Ala 


Glu 


Leu 


Lys 


Glu 




370 








375 






Lys 


Lys 


Leu 


Glu Asp 


Gly 


Pro 


Lys 


Phe 


385 








390 








He 


Val 


Asp 


Met Val 


Pro 


Gly 


Lys 


Pro 








405 










Asp 


Tyr 


Pro 


Pro Leu 


Gly Arg 


Phe 


Ala 








420 








425 


Val 


Ala 


Val 


Gly Val 


He 


Lys 


Ala 


Val 






435 








440 




Gly 


Lys 


Val 


Thr Lys 


Ser 


Ala 


Gin 


Lys 




450 








455 







<210> 677 
<211> 2328 
<212> PRT 

<213> Homo sapiens 
<400> 677 



Lys 
1 


Ser 


Lys 


Arg 


Gin 
5 


Ala 


Gin 


Gin 


Met 


Ala 


Val 


Ser 


Gin 


Ser 


Lys 


Pro 


Gly 


Cys 








20 










25 


Gin 


He 


Asn 


Gin 


Gin 


Trp 


Glu 


Arg 


Thr 






35 










40 




Cys 


Thr 


Cys 


Tyr 


Gly 


Gly 


Ser 


Arg 


Gly 




50 










55 






Glu 


Ala 


Glu 


Glu 


Thr 


Cys 


Phe 


Asp 


Lys 


65 










70 








Val 


Gly 


Asp 


Thr 


Tyr 
85 


Glu Arg 


Pro 


Lys 


Thr 


Cys 


He 


Gly 


Ala 


Gly Arg 


Gly 


Arg 








100 










105 


Arg 


Cys 


His 


Glu 


Gly 


Gly Gin 


Ser 


Tyr 






115 










120 




Arg 


Pro 


His 


Glu 


Thr 


Gly Gly 


Tyr 


Met 




130 










135 






Asn 


Gly 


Lys 


Gly 


Glu 


Trp 


Thr 


Cys 


Lys 


145 










150 








Asp 


His 


Ala 


Ala 


Gly 


Thr 


Ser 


Tyr 


Val 










165 










Pro 


Tyr 


Gin 


Gly 


Trp 


Met 


Met 


Val 


Asp 








180 










185 


Ser 


Gly 


Arg 


He 


Thr 


Cys 


Thr 


Ser 


Arg 






195 










200 




Thr 


Arg 


Thr 


Ser 


Tyr 


Arg 


He 


Gly 


Asp 




210 










215 






Arg 


Gly 


Asn 


Leu 


Leu 


Gin 


Cys 


He 


Cys 


225 










230 








Trp 


Lys 


Cys 


Glu 


Arg 


His 


Thr 


Ser 


Val 










245 










Gly 


Pro 


Phe 


Thr 


Asp 


Val 


Arg 


Ala 


Ala 








260 










265 
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365 



Lys 


He 


Asp 


Arg 


Arg 


Ser 


Gly 






380 










Leu 


Lys 


Ser 


Gly Asp 


Ala 


Ala 




395 










400 


Met 


Cys 


Val 


VJTJL LI 




Phe 


Ser 


410 










415 




Val 


Arg 


Asp 


Met 


Arg 


Gin 


Thr 










430 






Asp 


Lys 


Lys 


Ala 


Ala 


Gly 


Ala 








445 








Ala 


Gin 


Lys 


Ala 


Lys 










460 










Val 


Gin 


Pro 


Gin 


Ser 


Pro 


Val 


10 










15 




Tyr 


Asp 


Asn Gly Lys 


His 


Tyr 










30 






Tyr 


Leu 


Gly Asn Val 


Leu 


Val 








45 








Phe 


Asn 


Cys 


Glu 


Ser 


Lys 


Pro 






60 










Tyr 


Thr 


Gly Asn Thr 


Tyr 


Arg 




75 










80 


Asp 


Ser 


Met 


He 


Trp 


Asp 


Cys 


90 










95 




He 


Ser 


Cys 


Thr 


He 


Ala 


Asn 










110 






Lys 


He 


Gly Asp Thr 


Trp 


Arg 








125 








Leu 


Glu 


Cys 


Val 


Cys 


Leu 


Gly 






14 0 










Pro 


He 


Ala 


Glu 


Lys 


Cys 


Phe 




155 










160 


Val 


Gly 


Glu 


Thr 


Trp 


Glu 


Lys 


>170 










175 




Cys 


Thr 


Cys 


Leu Gly 


Glu 


Gly 










190 






Asn 


Arg 


Cys 


Asn 


Asp 


Gin 


Asp 








205 








Thr 


Trp 


Ser 


Lys 


Lys 


Asp 


Asn 






220 










Thr 


Gly 


Asn 


Gly Arg 


Gly 


Glu 




235 










240 


Gin 


Thr 


Thr 


Ser 


Ser 


Gly 


Ser 


250 










255 




Val 


Tyr 


Gin 


Pro 


Gin 


Pro 


His 










270 
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Pro Gin Pro Pro Pro Tyr Gly His Cys Val Thr Asp Ser Gly Val Val 

275 280 285 

Tyr Ser Val Gly Met Gin Trp Leu Lys Thr Gin Gly Asn Lys Gin Met 

290 295 300 

Leu Cys Thr Cys Leu Gly Asn Gly Val Ser Cys Gin Glu Thr Ala Val 
305 " 310 315 320 

Thr Gin Thr Tyr Gly Gly Asn Leu Asn Gly Glu Pro Cys Val Leu Pro 

325 330 335 

Phe Thr Tyr Asn Gly Arg Thr Phe Tyr Ser Cys Thr Thr Glu Gly Arg 

340 345 350 

Gin Asp Gly His Leu Trp Cys Ser Thr Thr Ser Asn Tyr Glu Gin Asp 

355 360 365 

Gin Lys Tyr Ser Phe Cys Thr Asp His Thr Val Leu Val Gin Thr Gin 

370 375 380 

Gly Gly Asn Ser Asn Gly Ala Leu Cys His Phe Pro Phe Leu Tyr Asn 
385 " 390 395 400 

Asn His Asn Tyr Thr Asp Cys Thr Ser Glu Gly Arg Arg Asp Asn Met 

405 410 415 

Lys Trp Cys Gly Thr Thr Gin Asn Tyr Asp Ala Asp Gin Lys Phe Gly 

420 425 430 

Phe Cys Pro Met Ala Ala His Glu Glu He Cys Thr Thr Asn Glu Gly 

435 440 445 

Val Met Tyr Arg He Gly Asp Gin Trp Asp Lys Gin His Asp Met Gly 

450 455 460 

His Met Met Arg Cys Thr Cys Val Gly Asn Gly Arg Gly Glu Trp Thr 
465 ^ 470 475 480 

Cys He Ala Tyr Ser Gin Leu Arg Asp Gin Cys He Val Asp Asp He 

485 490 495 

Thr Tyr Asn Val Asn Asp Thr Phe His Lys Arg His Glu Glu Gly His 

500 505 510 

Met Leu Asn Cys Thr Cys Phe Gly Gin Gly Arg Gly Arg Trp Lys Cys 

515 520 525 

Asp Pro Val Asp Gin Cys Gin Asp Ser Glu Thr Gly Thr Phe Tyr Gin 

530 535 540 

He Gly Asp Ser Trp Glu Lys Tyr Val His Gly Val Arg Tyr Gin Cys 
545 550 555 560 

Tyr Cys Tyr Gly Arg Gly He Gly Glu Trp His Cys Gin Pro Leu Gin 

565 570 575 

Thr Tyr Pro Ser Ser Ser Gly Pro Val Glu Val Phe He Thr Glu Thr 

580 585 590 

Pro Ser Gin Pro Asn Ser His Pro He Gin Trp Asn Ala Pro Gin Pro 

595 600 605 

Ser His He Ser Lys Tyr He Leu Arg Trp Arg Pro Lys Asn Ser Val 

610 615 620 

Gly Arg Trp Lys Glu Ala Thr He Pro Gly His Leu Asn Ser Tyr Thr 
625 J ^ 630 635 640 

He Lys Gly Leu Lys Pro Gly Val Val Tyr Glu Gly Gin Leu He Ser 

645 650 655 

He Gin Gin Tyr Gly His Gin Glu Val Thr Arg Phe Asp Phe Thr Thr 

660 665 670 

Thr Ser Thr Ser Thr Pro Val Thr Ser Asn Thr Val Thr Gly Glu Thr 

675 680 685 

Thr Pro Phe Ser Pro Leu Val Ala Thr Ser Glu Ser Val Thr Glu He 
690 695 700 
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Thr 


Ala 


Ser 


Ser 


Phe 


Val 


Val 


Ser 


Trp 


Val 


Ser 


Ala 


Ser 


Asp 


Thr 


Val 


705 










710 










715 










720 


Ser 


Gly 


Phe 


Arg Val 


Glu 


Tyr 


Glu 


Leu 


Ser 


Glu 


Glu 


Gly 


Asp 


Glu 


Pro 










725 










730 










735 




Gin 


Tyr 


Leu 


Asp 


Leu 


Pro 


Ser 


Thr 


Ala 


Thr 


Ser 


Val 


Asn 


He 


Pro 


Asp 








740 










745 










750 






Leu 


Leu 


Pro 


Gly Arg 


Lys 


Tyr 


He 


Val 


Asn 


Val 


Tyr 


Gin 


He 


Ser 


Glu 






755 










760 










765 








Asp 


Gly 


Glu 


Gin 


Ser 


Leu 


He 


Leu 


Ser 


Thr 


Ser 


Gin 


Thr 


Thr 


Ala 


Pro 




770 










775 










780 










Asp 


Ala 


Pro 


Pro 


Asp 


Pro 


Thr 


Val 


Asp 


Gin 


Val 


Asp 


Asp 


Thr 


Ser 


He 


785 










790 










795 










800 


Val 


Val 


Arg 


Trp 


Ser 


Arg 


Pro 


Gin 


Ala 


Pro 


He 


Thr 


Gly 


Tyr 


Arg 


He 










805 










810 










815 




Val 


Tyr 


Ser 


Pro 


Ser 


Val 


Glu 


Gly 


Ser 


Ser 


Thr 


Glu 


Leu 


Asn 


Leu 


Pro 








820 










825 










830 






Glu 


Thr 


Ala 


Asn 


Ser 


Val 


Thr 


Leu 


Ser 


Asp 


Leu 


Gin 


Pro 


Gly 


Val 


Gin 






835 










840 










845 








Tyr 


Asn 


He 


Thr 


He 


Tyr Ala 


Val 


Glu 


Glu 


Asn 


Gin 


Glu 


Ser 


Thr 


Pro 




850 










855 










860 










Val 


Val 


He 


Gin 


Gin 


Glu 


Thr 


Thr 


Gly 


Thr 


Pro 


Arg 


Ser 


Asp 


Thr 


Val 


865 










870 










875 










880 


Pro 


Ser 


Pro 


Arg 


Asp 


Leu 


Gin 


Phe 


Val 


Glu 


Val 


Thr 


Asp 


Val 


Lys 


Val 










885 










890 










895 




Thr 


He 


Met 


Trp 


Thr 


Pro 


Pro 


Glu 


Ser 


Ala 


Val 


Thr 


Gly 


Tyr 


Arg 


Val 








900 










905 










910 






Asp 


Val 


He 


Pro 


Val 


Asn 


Leu 


Pro 


Gly 


Glu 


His 


Gly 


Gin 


Arg 


Leu 


Pro 






915 










920 










925 








lie 


Ser 


Arg 


Asn 


Thr 


Phe 


Ala 


Glu 


Val 


Thr 


Gly 


Leu 


Ser 


Pro 


Gly 


Val 




930 










935 










940 










Thr 


Tyr 


Tyr 


Phe 


Lys 


Val 


Phe 


Ala 


Val 


Ser 


His 


Gly 


Arg 


Glu 


Ser 


Lys 


945 










950 










955 










960 


Pro 


Leu 


Thr 


Ala 


Gin 


Gin 


Thr 


Thr 


Lys 


Leu 


Asp 


Ala 


Pro 


Thr 


Asn 


Leu 










965 










970 










975 




Gin 


Phe 


Val 


Asn 


Glu 


Thr Asp 


Ser 


Thr 


Val 


Leu 


Val 


Arg 


Trp 


Thr 


Pro 








980 










985 










990 






Pro 


Arg 


Ala 


Gin 


He 


Thr Gly 


Tyr 


Arg 


Leu 


Thr 


Val 


Gly 


Leu 


Thr 


Arg 






995 










1000 








1005 






Arg 


Gly 


Gin 


Pro Arg 


Gin 


Tyr 


Asn 


Val 


Gly 


Pro 


Ser 


Val 


Ser 


Lys 


Tyr 




1010 








1015 








1020 








Pro 


Leu Arg 


Asn 


Leu 


Gin 


Pro 


Ala 


Ser 


Glu 


Tyr 


Thr 


Val 


Ser 


Leu 


Val 


1025 








1030 








1035 








104C 


Ala 


He 


Lys 


Gly Asn 


Gin 


Glu 


Ser 


Pro 


Lys 


Ala 


Thr 


Gly Val 


Phe 


Thr 










1045 








1050 








1055 


Thr 


Leu 


Gin 


Pro 


Gly 


Ser 


Ser 


He 


Pro 


Pro 


Tyr 


Asn 


Thr 


Glu 


Val 


Thr 








1060 








1065 








1070 




Glu 


Thr 


Thr 


He 


Val 


He 


Thr 


Trp 


Thr 


Pro 


Ala 


Pro 


Arg 


He 


Gly Phe 






1075 








1080 








1085 






Lys 


Leu Gly Val Arg 


Pro 


Ser 


Gin Gly 


Gly Glu Ala 


Pro 


Arg 


Glu 


Val 




1090 








1095 








1100 








Thr 


Ser Asp 


Ser Gly 


Ser 


He 


Val 


Val 


Ser Gly Leu 


Thr 


Pro Gly Val 


1105 








1110 








1115 








112C 


Glu 


Tyr Val 


Tyr 


Thr 


He 


Gin 


Val 


Leu 


Arg Asp Gly Gin Glu Arg 


Asp 



1125 1130 1135 
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Ala Pro lie Val Asn Lys Val Val Thr Pro Leu Ser Pro Pro Thr Asn 

1140 1145 1150 

Leu His Leu Glu Ala Asn Pro Asp Thr Gly Val Leu Thr Val Ser Trp 

1155 1160 * 1165 

Glu Arg Ser Thr Thr Pro Asp lie Thr Gly Tyr Arg lie Thr Thr Thr 

1170 1175 1180 

Pro Thr Asn Gly Gin Gin Gly Asn Ser Leu Glu Glu Val Val His Ala 
1185 1190 1195 1200 

Asp Gin Ser Ser Cys Thr Phe Asp Asn Leu Ser Pro Gly Leu Glu Tyr 

1205 1210 1215 

Asn Val Ser Val Tyr Thr Val Lys Asp Asp Lys Glu Ser Val Pro He 

1220 " 1225 1230 

Ser Asp Thr He He Pro Ala Val Pro Pro Pro Thr Asp Leu Arg Phe 

1235 1240 1245 

Thr Asn He Gly Pro Asp Thr Met Arg Val Thr Trp Ala Pro Pro Pro 

1250 1255 1260 

Ser He Asp Leu Thr Asn Phe Leu Val Arg Tyr Ser Pro Val Lys Asn 
1265 1270 1275 1280 

Glu Glu Asp Val Ala Glu Leu Ser He Ser Pro Ser Asp Asn Ala Val 

1285 1290 1295 

Val Leu Thr Asn Leu Leu Pro Gly Thr Glu Tyr Val Val Ser Val Ser 

1300 1305 1310 

Ser Val Tyr Glu Gin His Glu Ser Thr Pro Leu Arg Gly Arg Gin Lys 

1315 1320 1325 

Thr Gly Leu Asp Ser Pro Thr Gly He Asp Phe Ser Asp He Thr Ala 

1330 1335 1340 

Asn Ser Phe Thr Val His Trp He Ala Pro Arg Ala Thr He Thr Gly 
1345 1350 1355 1360 

Tyr Arg He Arg His His Pro Glu His Phe Ser Gly Arg Pro Arg Glu 

1365 1370 1375 

Asp Arg Val Pro His Ser Arg Asn Ser He Thr Leu Thr Asn Leu Thr 

1380 1385 1390 

Pro Gly Thr Glu Tyr Val Val Ser He Val Ala Leu Asn Gly Arg Glu 

1395 1400 1405 

Glu Ser Pro Leu Leu He Gly Gin Gin Ser Thr Val Ser Asp Val Pro 

1410 1415 1420 

Arg Asp Leu Glu Val Val Ala Ala Thr Pro Thr Ser Leu Leu He Ser 
1425 1430 1435 1440 

Trp Asp Ala Pro Ala Val Thr Val Arg Tyr Tyr Arg He Thr Tyr Gly 

1445 1450 1455 

Glu Thr Gly Gly Asn Ser Pro Val Gin Glu Phe Thr Val Pro Gly Ser 

1460 1465 1470 

Lys Ser Thr Ala Thr He Ser Gly Leu Lys Pro Gly Val Asp Tyr Thr 

1475 1480 1485 

He Thr Val Tyr Ala Val Thr Gly Arg Gly Asp Ser Pro Ala Ser Ser 

1490 1495 1500 

Lys Pro He Ser He Asn Tyr Arg Thr Glu He Asp Lys Pro Ser Gin 
1505 1510 1515 1520 

Met Gin Val Thr Asp Val Gin Asp Asn Ser He Ser Val Lys Trp Leu 

1525 1530 1535 

Pro Ser Ser Ser Pro Val Thr Gly Tyr Arg Val Thr Thr Thr Pro Lys 

1540 1545 1550 

Asn Gly Pro Gly Pro Thr Lys Thr Lys Thr Ala Gly Pro Asp Gin Thr 
1555 1560 1565 
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Glu Met Thr lie Glu Gly Leu Gin Pro Thr Val Glu Tyr Val Val Ser 

1570 1575 1580 

Val Tyr Ala Gin Asn Pro Ser Gly Glu Ser Gin Pro Leu Val Gin Thr 
1585 1590 1595 1600 

Ala Val Thr Asn He Asp Arg Pro Lys Gly Leu Ala Phe Thr Asp Val 

1605 1610 1615 

Asp Val Asp Ser He Lys He Ala Trp Glu Ser Pro Gin Gly Gin Val 

1620 1625 1630 

Ser Arg Tyr Arg Val Thr Tyr Ser Ser Pro Glu Asp Gly He His Glu 

1635 1640 1645 

Leu Phe Pro Ala Pro Asp Gly Glu Glu Asp Thr Ala Glu Leu Gin Gly 

1650 1655 1660 

Leu Arg Pro Gly Ser Glu Tyr Thr Val Ser Val Val Ala Leu His Asp 
1665 1670 1675 1680 

Asp Met Glu Ser Gin Pro Leu lie Gly Thr Gin Ser Thr Ala He Pro 

1685 1690 1695 

Ala Pro Thr Asp Leu Lys Phe Thr Gin Val Thr Pro Thr Ser Leu Ser 

1700 1705 1710 

Ala Gin Trp Thr Pro Pro Asn Val Gin Leu Thr Gly Tyr Arg Val Arg 

1715 1720 1725 

Val Thr Pro Lys Glu Lys Thr Gly Pro Met Lys Glu He Asn Leu Ala 

1730 1735 1740 

Pro Asp Ser Ser Ser Val Val Val Ser Gly Leu Met Val Ala Thr Lys 
1745 1750 1755 1760 

Tyr Glu Val Ser Val Tyr Ala Leu Lys Asp Thr Leu Thr Ser Arg Pro 

1765 1770 1775 

Ala Gin Gly Val Val Thr Thr Leu Glu Asn Val Ser Pro Pro Arg Arg 

1780 1785 1790 

Ala Arg Val Thr Asp Ala Thr Glu Thr Thr He Thr He Ser Trp Arg 

1795 1800 1805 

Thr Lys Thr Glu Thr He Thr Gly Phe Gin Val Asp Ala Val Pro Ala 

1810 1815 1820 

Asn Gly Gin Thr Pro He Gin Arg Thr He Lys Pro Asp Val Arg Ser 
1825 1830 1835 1840 

Tyr Thr He Thr Gly Leu Gin Pro Gly Thr Asp Tyr Lys He Tyr Leu 

1845 1850 1855 

Tyr Thr Leu Asn Asp Asn Ala Arg Ser Ser Pro Val Val He Asp Ala 

1860 1865 1870 

Ser Thr Ala He Asp Ala Pro Ser Asn Leu Arg Phe Leu Ala Thr Thr 

1875 1880 1885 

Pro Asn Ser Leu Leu Val Ser Trp Gin Pro Pro Arg Ala Arg He Thr 

1890 1895 1900 

Gly Tyr lie He Lys Tyr Glu Lys Pro Gly Ser Pro Pro Arg Glu Val 
1905 1910 1915 1920 

Val Pro Arg Pro Arg Pro Gly Val Thr Glu Ala Thr He Thr Gly Leu 

1925 1930 1935 

Glu Pro Gly Thr Glu Tyr Thr He Tyr Val He Ala Leu Lys Asn Asn 

1940 1945 1950 

Gin Lys Ser Glu Pro Leu lie Gly Arg Lys Lys Thr Asp Glu Leu Pro 

1955 1960 1965 

Gin Leu Val Thr Leu Pro His Pro Asn Leu His Gly Pro Glu He Leu 

1970 1975 1980 

Asp Val Pro Ser Thr Val Gin Lys Thr Pro Phe Val Thr His Pro Gly 
1985 1990 1995 2000 



-240- 



BNSOOCID: <WO 02092000A2_I_> 





WO 02/092000 



PCT/US02/14877 



Tyr Asp Thr Gly Asn Gly lie Gin Leu Pro Gly Thr Ser Gly Gin Gin 

2005 2010 2015 

Pro Ser Val Gly Gin Gin Met lie Phe Glu Glu His Gly Phe Arg Arg 

2020 2025 2030 

Thr Thr Pro Pro Thr Thr Ala Thr Pro lie Arg His Arg Pro Arg Pro 

2035 2040 2045 

Tyr Pro Pro Asn Val Gly Gin Glu Ala Leu Ser Gin Thr Thr lie Ser 

2050 " 2055 2060 

Trp Ala Pro Phe Gin Asp Thr Ser Glu Tyr lie He Ser Cys His Pro 
2065 2070 2075 208( 

Val Gly Thr Asp Glu Glu Pro Leu Gin Phe Arg Val Pro Gly Thr Ser 

2085 2090 2095 

Thr Ser Ala Thr Leu Thr Gly Leu Thr Arg Gly Ala Thr Tyr Asn He 

2100 2105 2110 

He Val Glu Ala Leu Lys Asp Gin Gin Arg His Lys Val Arg Glu Glu 

2115 2120 2125 

Val Val Thr Val Gly Asn Ser Val Asn Glu Gly Leu Asn Gin Pro Thr 

2130 2135 2140 

Asp Asp Ser Cys Phe Asp Pro Tyr Thr Val Ser His Tyr Ala Val Gly 
2145 2150 2155 216< 

Asp Glu Trp Glu Arg Met Ser Glu Ser Gly Phe Lys Leu Leu Cys Gin 

2165 2170 2175 

Cys Leu Gly Phe Gly Ser Gly His Phe Arg Cys Asp Ser Ser Arg Trp 

2180 2185 2190 

Cys His Asp Asn Gly Val Asn Tyr Lys He Gly Glu Lys Trp Asp Arg 

2195 2200 2205 

Gin Gly Glu Asn Gly Gin Met Met Ser Cys Thr Cys Leu Gly Asn Gly 

2210 2215 2220 

Lys Gly Glu Phe Lys Cys Asp Pro His Glu Ala Thr Cys Tyr Asp Asp 
2225 2230 2235 2241 

Gly Lys Thr Tyr His Val Gly Glu Gin Trp Gin Lys Glu Tyr Leu Gly 

2245 2250 2255 

Ala He Cys Ser Cys Thr Cys Phe Gly Gly Gin Arg Gly Trp Arg Cys 

2260 2265 2270 

Asp Asn Cys Arg Arg Pro Gly Gly Glu Pro Ser Pro Glu Gly Thr Thr 

2275 2280 2285 

Gly Gin Ser Tyr Asn Gin Tyr Ser Gin Arg Tyr His Gin Arg Thr Asn 

2290 2295 2300 

Thr Asn Val Asn Cys Pro He Glu Cys Phe Met Pro Leu Asp Val Gin 
2305 2310 2315 232i 

Ala Asp Arg Glu Asp Ser Arg Glu 



<210> 678 
<211> 188 
<212> PRT 
<213> Homo sapiens 

<400> 678 

His Gin Thr His Lys Glu Gly Gly Ser Thr His Ala Ser Ala Asp Ala 

15 10 15 

Trp Glu He He Glu Leu Glu Thr Glu He Glu Lys Phe Lys Ala Glu 

20 25 30 

Asn Ala Ser Leu Ala Lys Leu Arg He Glu Arg Glu Ser Ala Leu Glu 



2325 
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35 40 45 



Lys 


Leu 
50 


Arg 


Lys 


Glu 


He 


Ala 
55 


Asp 


Phe 


Glu 


Gin 


Gin 
60 


Lys 


Ala 


Lys 


Glu 


Leu 


Ala 


Arg 


He 


Glu 


Glu 


Phe 


Lys 


Lys 


Glu 


Glu 


Met 


Arg 


Lys 


Leu 


Gin 


65 










70 










75 










80 


Lys 


Glu 


Arg 


Lys 


Val 
85 


Phe 


Glu 


Lys 


Tyr 


Thr 
90 


Thr 


Ala 


Ala 


Arg 


Thr 
95 


Phe 


Pro 


Asp 


Lys 


Lys 
100 


Glu 


Arg 


Glu 


Glu 


He 
105 


Gin 


Thr 


Leu 


Lys 


Gin 
110 


Gin 


He 


Ala 


Asp 


Leu 
115 


Arg 


Glu 


Asp 


Leu 


Lys 
120 


Arg 


Lys 


Glu 


Thr 


Lys 
125 


Trp 


Ser 


Ser 


Thr 


His 
130 


Ser 


Arg 


Leu 


Arg 


Ser 
135 


Gin 


He 


Gin 


Met 


Leu 
14 0 


Val 


Arg 


Glu 


Asn 


Thr 


Asp 


Leu 


Arg 


Glu 


Glu 


He 


Lys 


Val 


Met 


Glu 


Arg 


Phe 


Arg 


Leu Asp 


145 










150 










155 










160 


Ala 


Trp 


Lys 


Arg 


Ala 
165 


Glu 


Ala 


He 


Glu 


Ser 
170 


Ser 


Leu 


Glu 


Val 


Glu 
175 


Lys 


Lys 


Asp 


Lys 


Leu 
180 


Ala 


Asn 


Thr 


Ser 


Val 
185 


Arg 


Phe 


Gin 
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<400> 679 



Met 


Glu 


Pro 


Gly Asn 


Tyr 


Ala Thr Leu 


Asp 


Gly Ala 


Lys 


Asp 


He 


Glu 


1 






5 






10 










15 




Gly 


Leu 


Leu 


Gly Ala 


Gly Gly Gly Arg 


Asn 


Leu 


Val 


Ala 


His 


Ser 


Pro 








20 




25 










30 






Leu 


Thr 


Ser 


His Pro 


Ala 


Ala Pro Thr 


Leu 


Met 


Pro 


Ala 


Val 


Asn Tyr 






35 






40 








45 








Ala 


Pro 


Leu 


Asp Leu 


Pro Gly Ser Ala 


Glu 


Pro 


Pro 


Lys 


Gin 


Cys 


His 




50 








55 






60 










Pro 


Cys 


Pro 


Gly Val 


Pro Gin Gly Thr 


Ser 


Pro 


Ala 


Pro 


Val 


Pro 


Tyr 


65 








70 






75 










80 


Gly 


Tyr 


Phe 


Gly Gly 


Gly Tyr Tyr Ser 


Cys 


Arg 


Val 


Ser 


Arg 


Ser 


Ser 








85 






90 










95 




Leu 


Lys 


Pro 


Cys Ala 
100 


Gin 


Ala Ala Thr 
105 


Leu 


Ala 


Ala 


Tyr 


Pro 
110 


Ala 


Glu 


Thr 


Pro 


Thr Ala Gly 


Glu 


Glu Tyr Pro 


Ser 


Arg 


Pro 


Thr 


Glu 


Phe 


Ala 






115 






120 








125 








Phe 


Tyr 
130 


Pro 


Gly Tyr 


Pro 


Gly Thr Tyr 
135 


His 


Ala 


Met 
140 


Ala 


Ser 


Tyr 


Leu 


Asp 


Val 


Ser 


Val Val 


Gin 


Thr Leu Gly 


Ala 


Pro Gly Glu 


Pro 


Arg 


His 


145 








150 






155 










160 


Asp 


Ser 


Leu 


Leu Pro 
165 


Val 


Asp Ser Tyr 


Gin 
170 


Ser 


Trp 


Ala 


Leu 


Ala 
175 


Gly 


Gly 


Trp 


Asn 


Ser Gin 


Met 


Cys Cys Gin 


Gly Glu 


Gin 


Asn 


Pro 


Pro 


Gly 








180 




185 










190 






Pro 


Phe 


Trp 


Lys Ala 


Ala 


Phe Ala Asp 


Ser 


Ser Gly Gin His 


Pro 


Pro 






195 






200 








205 








Asp 


Ala 
210 


Cys 


Ala Phe 


Arg 


Arg Gly Arg 
215 


Lys 


Lys 


Arg 
220 


He 


Pro 


Tyr 


Ser 
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Lys 


Gly Gin 


Leu Arg Glu Leu Glu Arg Glu Tyr Ala Ala 


Asn 


Lys 


Phe 


225 


230 




235 








^ *± \J 


lie 


Thr Lys 


Asp Lys Arg Arg 


Lys 


lie Ser Ala Ala 


Thr 


Ser 


Leu 


Ser 






245 




250 






255 




Glu 


Arg Gin 


lie Thr lie Trp 


Phe 


Gin Asn Arg Arg 


Val 


Lys 


Glu 


Lys 




260 




265 




270 






Lys 


Val Leu 


Ala Lys Val Lys 


Asn 


Ser Ala Thr Pro 












275 




280 












<210> 680 
















<211> 676 
















<212> PRT 
















<213> Homo sapiens 














<400> 680 
















Met 


Asp Lys 


Tyr Asp Asp Leu Gly Leu Glu Ala Ser 


Lys 


Phe 


He 


Glu 


1 


5 




10 






15 




Asp 


Leu Asn 


Met Tyr Glu Ala Ser Lys Asp Gly Leu 


Phe 


Arg 


Val 


Asp 






20 




25 




30 






Lys 


Gly Ala 


Gly Asn Asn Pro 


Glu 


Phe Glu Glu Thr 


Arg 


Arg 


Val 


Phe 




35 




40 




45 








Ala 


Thr Lys 


Met Ala Lys lie 


His 


Leu Gin Gin Gin 


Gin 


Gin 


Gin 


Leu 




50 


55 




60 










Leu 


Gin Glu 


Glu Thr Leu Pro Arg Gly Ser Arg Gly Pro 


Val 


Asn 


Gly 


65 




70 




75 








80 


Gly Gly Arg 


Leu Gly Pro Gin 


Ala 


Arg Trp Glu Val 


Val 


Gly 


Ser 


Lys 






85 




90 






95 




Leu 


Thr Val 


Asp Gly Ala Ala 


Lys 


Pro Pro Leu Ala 


Ala 


Ser 


Thr 


Gly 






100 




105 




110 






Ala 


Pro Gly 


Ala Val Thr Thr 


Leu Ala Ala Gly Gin 


Pro 


Pro 


Tyr 


Pro 




115 




12 0 




125 








Pro 


Gin Glu 


Gin Arg Ser Arg 


Pro 


Tyr Leu His Gly 


Thr 


Arg 


His 


Gly 




130 


135 




140 










Ser 


Gin Asp 


Cys Gly Ser Arg 


Glu 


Ser Leu Ala Thr 


Ser 


Glu 


Met 


Ser 


145 


150 




155 








160 


Ala 


Phe His 


Gin Pro Gly Pro 


Cys 


Glu Asp Pro Ser 


Cys 


Leu 


Thr 


His 






165 




170 






175 




Gly Asp Tyr 


Tyr Asp Asn Leu 


Ser 


Leu Ala Ser Pro 


Lys 


Trp 


Gly 


Asp 






180 




185 




190 






Lys 


Pro Gly 


Val Ser Pro Ser 


lie 


Gly Leu Ser Val 


Gly 


Ser Gly 


Trp 


195 




200 




205 








Pro 


Ser Ser 


Pro Gly Ser Asp 


Pro 


Pro Leu Pro Lys 


Pro 


Cys 


Gly 


Asp 




210 


215 




220 










His 


Pro Leu 


Asn His Arg Gin 


Leu 


Ser Leu Ser Ser 


Ser 


Arg 


Ser 


Ser 


225 




230 




235 








240 


Glu 


Gly Ser 


Leu Gly Gly Gin Asn 


Ser Gly lie Gly Gly Arg 


Ser 


Ser 




245 




250 






255 




Glu 


Lys Pro 


Thr Gly Leu Trp 


Ser 


Thr Ala Ser Ser 


Gin 


Arg 


Val 


Ser 




260 




265 




270 






Pro 


Gly Leu 


Pro Ser Pro Asn 


Leu 


Glu Asn Gly Ala 


Pro 


Ala 


Val 


Gly 




275 




280 




285 








Pro 


Val Gin 


Pro Arg Thr Pro 


Ser 


Val Ser Ala Pro 


Leu 


Ala 


Leu 


Ser 




290 


295 




300 










Cys 


Pro Arg 


Gin Gly Gly Leu 


Pro 


Arg Ser Asn Ser 


Gly 


Leu 


Gly 


Gly 
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V 
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305 310 



Glu 


Val 


Ser 


Gly 


Val 


Met 


Ser Lys 


Pro 










325 








Phe 


Gin Asp 


Gly 


Pro 


Lys 


Ser Tyr 


Leu 








340 








345 


Ser 


Pro 


Ala 


Gly 


Leu 


Asp 


Gly Ser 


Gin 






355 








360 




Gly 


Pro 


Lys 


Pro 


Gly Cys 


Thr Asp Leu 




370 










375 




Pro 


Thr 


Ser 


Leu 


Val 


His 


Pro Val 


Met 


385 










390 






Cys 


Lys 


Glu 


Gly 


Pro 


Leu 


Gly Trp 


Ser 










405 








Val 


Leu 


Leu 


Asp 


Ser 


Pro 


Ser Ser 


Pro 








420 








425 


Pro 


Leu 


Val 


Pro 


Gly 


Pro 


Glu Leu 


Arg 






435 








440 




Leu 


Glu 


Ala 


Leu 


Thr 


Gin 


Arg Leu 


Glu 




450 










455 




Lys 


Ala Asp 


Tyr 


Phe 


Gly 


Ala Cys 


Val 


465 










470 






Gly 


Ala Gly 


Gin 


Ala 


Cys 


Gin Ala 


Met 










485 








Cys 


Phe 


Thr 


Cys 


Ala 


Ala 


Cys Ser 


Arg 








500 








505 


Tyr 


Phe 


Val 


Asn 


Gly Lys 


Val Phe 


Cys 






515 








520 




Gly 


Phe 


Gin 


Gin 


Ser 


Ala 


Asp Arg 


Cys 




530 










535 




Met 


As P 


Met 


lie 


Leu 


Gin 


Ala Leu Gly 


545 










550 






Phe 


Arg 


Cys 


Val 


lie 


Cys 


Asn Glu 


Cys 










565 








Val 


Asp 


Ser 


Glu 


Asn 


Lys 


lie Tyr 


Cys 








580 








585 


Leu 


Ala 


Pro 


Lys 


Cys 


Ala 


Ala Cys 


Gly 






595 








600 




Gly 


Ser 


Asp 


Glu 


Thr 


lie 


Arg Val 


Val 




610 










615 




Val 


Glu Cys 


Tyr 


His 


Cys 


Glu Asp 


Cys 


625 










630 






Asp 


Gly His 


Arg 


Cys 


Tyr 


Pro Leu 


Glu 










645 








Cys 


His 


Val 


Lys 


Arg 


Leu 


Glu Lys 


Arg 








660 








665 


Gin 


His 


His 


Phe 














675 
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315 320 



Asn 


Val 


Asp 


Pro 


Gin 


Pro 


Trp 


330 










335 




Ser 


Ser 


Ser 


Ala 


Pro 


Ser 


Ser 










350 






Gin 


Gly 


Ala 


Val 


Pro Gly 


Leu 








365 








Gly 


Thr 


Gly 


Pro 


Lys 


Leu 


Ser 






380 










Ser 


Thr 


Leu 


Pro 


Glu 


Leu 


Ser 




395 










400 


Ser 


Asp 


Gly Ser 


Leu Gly 


Ser 


410 










415 




Arg 


Val 


Arg 


Leu 


Pro 


Cys 


Gin 










430 






Pro 


Ser 


Ala 


Ala 


Glu 


Leu 


Lys 








445 








Arg 


Glu 


Met 


Asp 


Ala 


His 


Pro 






460 










Lys 


Cys 


Ser 


Lys 


Gly Val 


Phe 




475 










480 


Gly 


Asn 


Leu 


Tyr 


His 


Asp 


Thr 


490 










495 




Lys 


Leu 


Arg Gly 


Lys 


Ala 


Phe 










510 






Glu 


Glu 


Asp 


Phe 


Leu 


Tyr 


Ser 








525 








Phe 


Leu 


Cys 


Gly His 


Leu 


He 






540 










Lys 


Ser 


Tyr 


His 


Pro 


Gly 


Cys 




555 










560 


Leu 


Asp 


Gly Val 


Pro 


Phe 


Thr 


570 










575 




Val 


Arg 


Asp 


Tyr 


His 


Lys 


Val 










590 






Leu 


Pro 


He 


Leu 


Pro 


Pro 


Glu 








605 








Ser 


Met 


Asp 


Arg 


Asp 


Tyr 


His 






620 










Gly 


Leu 


Glu 


Leu 


Asn Asp 


Glu 




635 










640 


Asp 


His 


Leu 


Phe 


Cys 


His 


Ser 


650 










655 




Pro 


Ser 


Ser 


Thr 


Ala 


Leu 


His 



670 
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Ser 


Thr 


Gly Ser 


1 






Ala 


Ala 


Arg Ala 






20 


Arg 


Glu 


Thr Arg 






35 


Thr 


Met 


Thr Gly 




50 




Gin 


Arcr 


Cys Gin 


65 






Asn 


Gly 


Glu Leu 


Arq 


Pro 


Phe Pro 






100 


Cys 


Glu 


His Asp 






115 


Gly 


Glu 


Phe lie 




130 




His 


Pro 


Glv Cvs 


145 






Leu 


Glv 


Phe Val 


Asn 


Arcr 


Glu Lys 






180 


His 


Leu 


Val lie 






195 


His 


Pro 


Asp His 




210 




Glu 


Ala 


Arg Glu 


225 






Lys 


Met 


Gly Val 


Arg 


Val 


Val Asn 






260 


Cys 


Ala 


Lys Cys 






275 


Lys 


Gly 


Leu Ala 



290 



Glu 


Phe 


Pro 


Leu 


5 








Gly Gly Arg 


Ala 


Gly Asn 


Pro 


Gly 








40 


Ser 


Asn 


Met 


Ser 






55 




Ala 


Arg 


Phe 


Ser 




70 






Tyr 


His 


Glu 


His 


85 








Glu 


Gly 


Leu 


Phe 


Phe 


Gin 


Met 


Leu 








120 


He 


Gly 


Arg 


Val 






135 




Phe 


Arg 


Cys 


Glu 




150 






Lys 


Asn 


Ala 


Glv 


165 








Ala 


Lys 


Gly 


Leu 


Asp 


Glu 


Gin 


Pro 








200 


Phe 


Asn 


Cys 


Thr 






215 




Leu 


Lys 


Gly 


Glu 




230 






Pro 


He 


Cys 


Glv 


245 








Ala Leu Gly 


Lys 


Glu 


Lys 


Pro 


Phe 








280 


Tyr 


Cys 


Glu 


Leu 






295 





Cys Thr Lys Ala 
10 

Leu Gly Trp Arg 
25 

Asn Pro Gly Leu 

Asp Ala Leu Ala 
60 

Pro Ala Glu Arg 
75 

Cys Phe Val Cys 
90 

Tyr Glu Phe Glu 
105 

Phe Ala Pro Cys 

He Lys Ala Met 
140 

Leu Cys Asp Val 
155 

Arg His Leu Cys 
170 

Gly Lys Tyr He 
185 

Leu Met Phe Arg 

His Cys Gly Lys 
220 

Leu Tyr Cys Leu 
235 

Ala Cys Arg Arg 
250 

Gin Trp His Val 
265 

Leu Gly His Arg 



Ser 


Pro 


Cys 


Ser 






15 




Leu 


Gin 


Gin 


Gin 




30 






Gly 


Val 


Ala 


Ala 


45 








Asn 


Ala 


Val 


Cys 


He 


Val 


Asn 


Ser 








80 


Ala 


Gin 


Cys 


Phe 






95 




Gly 


Arcr 


Lys 


Tyr 




110 






Cys 


Gly 


Ser 


Cys 


125 








Asn 


Asn 


Asn 


Trp 


Glu 


Leu 


Ala 


Asp 








160 


Arg 


Pro 


Cys 


His 






175 




Cvs 


Gin Arg 


Cvs 




190 






Ser 


Asp 


Ala 


Tyr 


205 








Glu 


Leu 


Thr 


Ala 


Pro 


Cys 


His 


Asp 








240 


Pro 


He 


Glu 


Gly 






255 




Glu 


His 


Phe 


Val 




270 






His 


Tyr 


Glu 


Lys 


285 
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<400> 682 

Met Gly He Gly Leu Ser Ala Gin 

1 5 
Gly Trp Asp Lys His Ser Tyr Gly 
20 

Phe Cys Ser Ser Gly Thr Gly Gin 

35 " 40 

Gly Asp Val He Gly Cys Cys Val 

50 55 
Tyr Thr Lys Asn Gly His Ser Leu 



Gly Val Asn Met Asn Arg Leu Pro 

10 15 
Tyr His Gly Asp Asp Gly His Ser 
25 3 0 

Pro Tyr Gly Pro Thr Phe Thr Thr 
45 

Asn Leu He Asn Asn Thr Cys Phe 
60 

Gly He Ala Phe Thr Asp Leu Pro 
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65 










70 






Pro 


Asn 


Leu 


Tyr 


Pro 


Thr Val 


Gly 


Leu 










85 








Asp 


Ala 


Asn 


Phe 


Gly Gin His 


Pro 


Phe 








100 








105 


Met 


Arg 


Glu 


Trp 


Arg 


Thr Lys 


He 


Gin 






115 








120 




lie 


Gly 


Asp 


Arg 


Glu 


Gly Glu Trp 


Gin 




130 








135 






Ser 


Ser 


Tyr 


Leu 


Val 


His His 


Gly 


Tyr 


145 










150 






Ala 


Arg 


Ser 


Thr Asp 


Gin Thr 


Val 


Leu 










165 








Asn 


Arg 


Gin 


Arg 


He 


Gin Lys 


Leu 


Val 








180 








185 


Ala 


lie 


Glu 


Thr 


Thr 


Gin Gin 


Leu 


Tyr 






195 








200 




Pro 


Asn 


Leu 


Leu 


Phe 


Thr Leu 


Lys 


Val 




210 








215 






Asn 


Gly 


Thr 


Asp 


Ser 


Glu Val 


Arg 


Cys 


225 










230 






Ser 


Gin 


Asp 


Ser 


Tyr 


Pro Val 


Ser 


Pro 










245 








Met 


Ser 


Pro 


Ser 


His 


Gly Met Asn 


He 








260 








265 


Gly 


Ser 


Thr 


Ala 


His 


Phe Ser 


Gly 


Phe 






275 








280 




lie 


Ser 


Asn 


Lys 


Ala 


His Gin 


Ser 


Tyr 




290 








295 






Ser 


Ser 


Asn 


Leu 


Asn 


Val Pro 


Glu 


Leu 


305 










310 






Ser 


Gin 


Gin 


Val 


Asn 


Asn Phe 


Thr 


Ser 










325 








Asp 


His 


Tyr 


Ser Asn Gly Val 


Gly Glu 








340 








345 


Asn 


Gly 


Ser 


Ser 


Lys 


His Asp 


His 


Glu 






355 








360 




Met 


Glu 


Val 


Asp 


Ser 


Ser Gin 


Leu 


Arg 




370 








375 






Gin 


Ala 


Ala 


He 


Glu 


Arg Met 


He 


His 


385 










390 






Met 


Ser 


Glu 


Gin 


Leu 


Arg Arg 


Asp 


Cys 










405 








Lys 


Met 


Leu 


Lys 


Asp 


Ala Phe 


Ser 


Leu 








420 








425 


Asn 


Ser 


Pro 


Val 


Gly Asn Gin Leu Asp 






435 








440 




Cys 


Ser 


Ala 


Leu 


Asn 


Ser Ala 


He 


Leu 




450 








455 






Gin 


Pro 


Pro 


Leu 


Ala 


Leu Ala 


Met 


Gly 


465 










470 






Leu 


Met 


Ala Arg Ser Gly lie 


Gly Ser 










485 








Asp 


Tyr 


Leu 


His 
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75 










80 


Gin 


Thr 


Pro Gly Glu 


Val 


Val 


90 










95 




Val 


Phe 


Asp 


He 


Glu 


Asp 


Tyr 










110 






Ala 


Gin 


He 


Asp Arg 


Phe 


Pro 








125 








Thr 


Met 


He 


Gin 


Lys 


Met 


Val 






140 










Cys 


Ala 


Thr 


Ala 


Glu 


Ala 


Phe 




155 










160 


Glu 


Glu 


Leu 


Ala 


Ser 


He 


Lys 


170 










175 




Leu 


Ala 


Gly Arg Met 


Gly 


Glu 










190 






Pro 


Ser 


Leu 


Leu 


Glu 


Arg 


Asn 








205 








Arg 


Gin 


Phe 


He 


Glu 


Met 


Val 






220 










Leu Gly Gly Arg 


Ser 


Pro 


Lys 




235 










240 


Arg 


Pro 


Phe 


Ser 


Ser 


Pro 


Ser 


250 










255 




His 


Asn 


Leu 


Ala 


Ser 


Gly 


Lys 










270 






Glu 


Ser 


Cys 


Ser 


Asn 


Gly 


Val 








285 








Cys 


His 


Ser 


Asn 


Lys 


His 


Gin 






300 










Asn 


Ser 


He 


Asn 


Met 


Ser 


Arg 




315 










320 


Asn 


Asp 


Val 


Asp 


Met 


Glu 


Thr 


330 










335 




Thr 


Ser 


Ser Asn Gly 


Phe 


Leu 










350 






Met 


Glu 


Asp 


Cys 


Asp 


Thr 


Glu 








365 








Arg 


Gin 


Leu 


Cys 


Gly 


Gly 


Ser 






380 










Phe 


Gly Arg Glu Leu 


Gin 


Ala 




395 










400 


Gly 


Lys 


Asn 


Thr 


Ala 


Asn 


Lys 


410 










415 




Leu 


Ala 


Tyr 


Ser 


Asp 


Pro 


Trp 










430 






Pro 


He 


Gin 


Arg 


Glu 


Pro 


Val 








445 








Glu 


Thr 


His 


Asn 


Leu 


Pro 


Lys 






460 










Gin 


Ala 


Thr 


Gin 


Cys 


Leu 


Gly 




475 










480 


Cys 


Ala 


Phe 


Ala 


Thr 


val 


Glu 


490 










495 
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500 

<210> 683 
<211> 387 
<212> PRT 

<213> Homo sapiens 
<400> 683 

Met Ala Thr Ser Gly Val Leu Pro Gly Gly Gly Phe Val Ala Ser Ala 

15 10 15 

Ala Ala Val Ala Gly Pro Glu Met Gin Thr Gly Arg Asn Asn Phe Val 

20 25 30 

lie Arg Arg Asn Pro Ala Asp Pro Gin Arg lie Pro Ser Asn Pro Ser 

35 40 45 

His Arg lie Gin Cys Ala Ala Gly Tyr Glu Gin Ser Glu His Asn Val 

50 55 60 

Cys Gin Asp He Asp Glu Cys Thr Ala Gly Thr His Asn Cys Arg Ala 
65 *' 70 75 80 

Asp Gin Val Cys He Asn Leu Arg Gly Ser Phe Ala Cys Gin Cys Pro 

85 90 95 

Pro Gly Tyr Gin Lys Arg Gly Glu Gin Cys Val Asp He Asp Glu Cys 

100 105 HO 

Thr He Pro Pro Tyr Cys His Gin Arg Cys Val Asn Thr Pro Gly Ser 

115 120 125 

Phe Tyr Cys Gin Cys Ser Pro Gly Phe Gin Leu Ala Ala Asn Asn Tyr 

130 135 140 

Thr Cys Val Asp He Asn Glu Cys Asp Ala Ser Asn Gin Cys Ala Gin 
145 150 155 160 

Gin Cys Tyr Asn He Leu Gly Ser Phe He Cys Gin Cys Asn Gin Gly 

165 170 175 

Tyr Glu Leu Ser Ser Asp Arg Leu Asn Cys Glu Asp He Asp Glu Cys 

180 185 190 

Arg Thr Ser Ser Tyr Leu Cys Gin Tyr Gin Cys Val Asn Glu Pro Gly 

195 200 205 

Lys Phe Ser Cys Met Cys Pro Gin Gly Tyr Gin Val Val Arg Ser Arg 

210 215 220 

Thr Cys Gin Asp He Asn Glu Cys Glu Thr Thr Asn Glu Cys Arg Glu 
225 ~ 230 235 240 

Asp Glu Met Cys Trp Asn Tyr His Gly Gly Phe Arg Cys Tyr Pro Arg 

245 250 255 

Asn Pro Cys Gin Asp Pro Tyr He Leu Thr Pro Glu Asn Arg Cys Val 

260 265 270 

Cys Pro Val Ser Asn Ala Met Cys Arg Glu Leu Pro Gin Ser He Val 

275 280 285 

Tyr Lys Tyr Met Ser He Arg Ser Asp Arg Ser Val Pro Ser Asp He 

290 295 300 

Phe Gin He Gin Ala Thr Thr He Tyr Ala Asn Thr He Asn Thr Phe 
305 310 315 320 

Arg He Lys Ser Gly Asn Glu Asn Gly Glu Phe Tyr Leu Arg Gin Thr 

325 330 335 

Ser Pro Val Ser Ala Met Leu Val Leu Val Lys Ser Leu Ser Gly Pro 

340 345 350 

Arg Glu His He Val Asp Leu Glu Met Leu Thr Val Ser Ser He Gly 
355 360 365 
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Thr Phe Arg Thr Ser Ser Val Leu Arg 

370 375 
Phe Ser Phe 
385 

<210> 684 
<211> 531 
<212> PRT 
<213> Homo sapiens 



<400> 684 



Met 


Ser 


Lys Pro His 


Ser 


Glu 


Ala 


Gly 


1 




5 










Gin 


Leu 


His Ala Ala 


Met 


Ala 


Asp 


Thr 






20 








25 


Leu 


Asp 


He Asp Ser 


Pro 


Pro 


He 


Thr 






35 






40 




Cys 


Thr 


He Gly Pro Ala 


Ser 


Arg 


Ser 




50 






55 






lie 


Lys 


Ser Gly Met 


Asn 


Val 


Ala 


Arg 


65 






70 








His 


Glu 


Tyr His Ala 


Glu 


Thr 


He 


Lys 






85 










Ser 


Phe 


Ala Ser Asp 


Pro 


Tyr 


Leu 


Tyr 






100 








105 


Asp 


Thr 


Lys Gly Pro Glu 


He 


Arg 


Thr 






115 






120 




Thr 


Ala 


Glu Leu Glu 


Leu 


Lys 


Lys 


Gly 




130 






135 






Asp 


Asn 


Ala Tyr Met 


Glu 


Lys 


Cys 


As P 


145 






150 








Tyr 


Lys 


Asn He Cys 


Lys 


Val 


Val 


Glu 






165 










Asp 


Asp 


Gly Leu He 


Ser 


Leu 


Gin 


Val 






180 








185 


Leu 


Val 


Thr Glu Val 


Glu 


Asn 


Gly Gly 






195 






200 




Val 


Asn 


Leu Pro Gly Ala 


Ala 


Val 


Asp 




210 






215 






Asp 


He 


Gin Asp Leu 


Lys 


Phe 


Gly Val 


225 






230 








Phe 


Ala 


Ser Phe He 


Arg 


Lys 


Ala 


Ser 






245 










Val 


Leu 


Gly Glu Lys 


Gly 


Lys 


Asn 


He 






260 








265 


Asn 


His 


Glu Gly Val Arg Arg 


Phe 


Asp 






275 






280 




Gly 


He 


Met Val Ala Arg 


Gly 


Asp 


Leu 




290 






295 






Lys 


Val 


Phe Leu Ala 


Gin 


Lys 


Met 


Met 


305 






310 








Gly 


Lys 


Pro Val He 


Cys 


Ala 


Thr 


Gin 






325 










Lys 


Pro 


Arg Pro Thr 


Arg 


Ala 


Glu Gly 
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Leu 


Thr 


He 


He 


Val 


Gly 


Pro 






380 










Thr 


Ala 


Phe 


He 


Gin 


Thr 


Gin 


10 










15 




Phe 


Leu 


Glu 


His 


Met 


Cys 


Arg 










30 






Ala 


Arg 


Asn 


Thr 


Gly 


He 


He 








45 








Val 


Glu 


Thr 


Leu 


Lys 


Glu 


Met 






60 










Leu 


Asn 


Phe 


Ser 


His 


Gly 


Thr 




75 










80 


Asn 


Val 


Arg 


Thr 


Ala 


Thr 


Glu 


90 










95 




Arg 


Pro 


Val 


Ala 


Val 


Ala 


Leu 










110 






Gly 


Leu 


He 


Lys 


Gly 


Ser 


Gly 








125 








Ala 


Thr 


Leu 


Lys 


He 


Thr 


Leu 






140 










Glu 


Asn 


He 


Leu 


Trp 


Leu 


Asp 




155 










160 


Val 


Gly 


Ser 


Lys 


He 


Tyr 


Val 


170 










175 




Lys 


Gin 


Lys 


Gly 


Ala 


Asp 


Phe 










190 






Ser 


Leu 


Gly 


Ser 


Lys 


Lys 


Gly 








205 








Leu 


Pro 


Ala 


Val 


Ser 


Glu 


Lys 






220 










Glu 


Gin 


Asp 


Val 


Asp 


Met 


Val 




235 










240 


Asp 


Val 


His 


Glu 


Val 


Arg 


Lys 


250 










255 




Lys 


He 


He 


Ser 


Lys 


He 


Glu 










270 






Glu 


He 


Leu 


Glu 


Ala 


Ser 


Asp 








285 








Gly 


He 


Glu 


He 


Pro 


Ala 


Glu 






300 










He 


Gly 


Arg 


Cys 


Asn 


Arg 


Ala 




315 










320 


Met 


Leu 


Glu 


Ser 


Met 


He 


Lys 


330 










335 




Ser 


Asp 


Val 


Ala 


Asn 


Ala 


Val 
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340 










345 






350 




LeU 


Asp 


Glv 


Ala 


ASD 


Cvs 


He 


Met 


Leu 


Ser Gly 


Glu Thr 


Ala Lys 


Gly 






355 










360 






365 






Asp 


Tvr 


Pro 


Leu 


Glu 


Ala 


Val 


Arg 


Met 


Gin His 


Leu He 


Ala Arg 


Glu 




370 










375 








380 






Ala 


Glu 


Ala 


Ala 


lie 


Tvr 


His 


Leu 


Gin 


Leu Phe 


Glu Glu 


Leu Arg 


Arg 


385 










390 








395 






400 


Leu 


Ala 


Pro 


lie 


Thr 


Ser 


Asp 


Pro 


Thr 


Glu Ala 


Thr Ala Val Gly 


Ala 










405 










410 




415 




Val 


Glu 


Ala 


Ser 


Phe 


Lvs 

J; 


Cvs 


Cvs 


Ser 


Gly Ala 


He He 


Val Leu 


Thr 








420 










425 






430 




Lys 


Sex* 


Glv 




Ser 


Ala 


His 


Gin 


Val 


Ala Arg 


Tyr Arg 


Pro Arg 


Ala 






435 










440 






445 






Pro 


lie 


lie 


Ala 


Val 


Thr 


Arcr 


Asn 


Pro 


Gin Thr 


Ala Arg 


Gin Ala 


His 




450 










455 








460 






Leu 


Tyr 


Arg 


Gly 


He 


Phe 


Pro 


Val 


Leu 


Cys Lys 


Asp Pro 


Val Gin 


Glu 


465 










470 








475 






480 


Ala 


Trp 


Ala 


Glu 


Asp 


Val 


Asp 


Leu 


Arg 


Val Asn 


Phe Ala 


Met Asn 


Val 










485 










490 




495 




Gly 


Lys 


Ala 


Arg 


Gly 


Phe 


Phe 


Lys 


Lys 


Gly Asp 


Val Val 


He Val 


Leu 








500 










505 






510 




Thr 


Gly 


Trp 


Arg 


Pro 


Gly 


Ser 


Gly 


Phe 


Thr Asn 


Thr Met 


Arg Val 


Val 



515 520 525 



Pro Val Pro 
530 

<210> 685 

<211> 480 

<212> PRT 

<213> Homo sapiens 



<400> 685 



Met 


Ala 


Ala 


Arg 


Cys 


Ser 


Thr 


Arg 


Trp 


Leu 


Leu 


Val 


Val 


Val 


Gly 


Thr 


1 








5 










10 










15 




Pro 


Arg 


Leu 


Pro 


Ala 


He 


Ser 


Gly 


Arg Gly Ala 


Arg 


Pro 


Pro 


Arg 


Glu 








20 










25 










30 






Gly 


Val 


Val 


Gly Ala 


Trp 


Leu 


Ser 


Arg 


l y s 


Leu 


Ser 


Val 


Pro 


Ala 


Phe 






35 










40. 










45 








Ala 


Ser 
50 


Ser 


Leu 


Thr 


Ser 


Cys 
55 


Gly 


Pro 


Arg 


Ala 


Leu 
60 


Leu 


Thr 


Leu 


Arg 


Pro 


Gly 


Val 


Ser Leu Thr Gly Thr 


Lys 


His 


Asn 


Pro 


Phe 


He 


Cys 


Thr 


65 










70 










75 










80 


Ala 


Ser 


Phe 


His 


Thr 


Ser 


Ala 


Pro 


Leu 


Ala 


Lys 


Glu Asp 


Tyr 


Tyr 


Gin 










85 










90 










95 




He 


Leu 


Gly 


Val 
100 


Pro 


Arg 


Asn 


Ala 


Ser 
105 


Gin 


Lys 


Glu 


He 


Lys 
110 


Lys 


Ala 


Tyr 


Tyr 


Gin 
115 


Leu 


Ala 


Lys 


Lys 


Tyr 
120 


His 


Pro 


Asp 


Thr 


Asn 
125 


Lys 


Asp 


Asp 


Pro 


Lys 
130 


Ala 


Lys 


Glu 


Lys 


Phe 
135 


Ser 


Gin 


Leu 


Ala 


Glu 
14 0 


Ala 


Tyr 


Glu 


Val 


Leu 


Ser 


Asp 


Glu 


Val 


Lys 


Arg 


Lys 


Gin Tyr Asp 


Ala 


Tyr 


Gly Ser 


Ala 


145 










150 










155 










160 


Gly 


Phe 


Asp 


Pro 


Gly Ala 


Ser Gly 


Ser 


Gin 


His 


Ser 


Tyr 


Trp 


Lys 


Gly 








165 










170 










175 
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Gly 


Pro 


Thr 


Val 


Asp 


Pro 


Glu Glu Leu 








180 






185 


Phe 


Ser 


Ser 


Ser 


Ser 


Phe 


Gly Asp Phe 






195 








200 


Gin 


Glu 


Tyr 


Phe 


Met 


Glu 


Leu Thr Phe 




210 










215 


Asn 


Lys 


Glu 


Phe 


Thr 


Val 


Asn He Met 


225 










230 




Gly 


Lys 


Gly Asn Glu 


Pro 


Gly Thr Lys 










245 






Gly 


Gly 


Ser 


Gly 


Met 


Glu 


Thr He Asn 








260 






265 


Ser 


Thr 


Cys 


Arg 


Arg 


Cys 


Gly Gly Arg 






275 








280 


Cys 


Val 


Val 


Cys 


Arg 


Gly 


Ala Gly Gin 




290 










295 


Met 


He 


Pro 


Val 


Pro 


Ala 


Gly Val Glu 


305 










310 




Pro 


Val 


Gly Lys 


Arg 


Glu 


He Phe He 










325 






Pro 


Val 


Phe 


Arg 


Arg 


Asp 


Gly Ala Asp 








340 






345 


Ser 


He 


Ala 


Gin 


Ala 


Leu 


Leu Gly Gly 






355 








360 


Tyr 
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Tyr Arg Asp Phe Ser Leu Arg Met Glu Lys Tyr Arg Thr Ser Leu Glu 

1170 1175 1180 

Lys Ala Leu Gly lie Ser Ser Asp Ser Asn Lys Ser Ser Lys Ser Leu 
1185 1190 1195 1200 

Gin Leu Asp lie lie Pro Ala Ser lie Pro Gly Ser Glu Val Lys Leu 

1205 1210 1215 

Arg Asp Ala Ala His Glu Leu Asn Glu Glu Lys Arg Lys Ser Ala Arg 

1220 1225 1230 

Arg Lys Glu Phe lie Met Ala Glu Leu He Gin Thr Glu Lys Ala Tyr 

1235 1240 1245 

Val Arg Asp Leu Arg Glu Cys Met Asp Thr Tyr Leu Trp Glu Met Thr 

1250 1255 1260 

Ser Gly Val Glu Glu He Pro Pro Gly He Val Asn Lys Glu Leu He 
1265 1270 1275 1280 

He Phe Gly Asn Met Gin Glu He Tyr Glu Phe His Asn Asn He Phe 

1285 1290 1295 

Leu Lys Glu Leu Glu Lys Tyr Glu Gin Leu Pro Glu Asp Val Gly His 

1300 1305 1310 

Cys Phe Val Thr Trp Ala Asp Lys Phe Gin Met Tyr Val Thr Tyr Cys 

1315 1320 1325 

Lys Asn Lys Pro Asp Ser Thr Gin Leu He Leu Glu His Ala Gly Ser 

1330 1335 1340 

Tyr Phe Asp Glu He Gin Gin Arg His Gly Leu Ala Asn Ser He Ser 
1345 *" 1350 1355 1360 

Ser Tyr Leu He Lys Pro Val Gin Arg He Thr Lys Tyr Gin Leu Leu 

13 65 1370 1375 

Leu Lys Glu Leu Leu Thr Cys Cys Glu Glu Gly Lys Gly Glu He Lys 

1380 1385 1390 

Asp Gly Leu Glu Val Met Leu Ser Val Pro Lys Arg Ala Asn Asp Ala 

1395 1400 1405 

Met His Leu Ser Met Leu Glu Gly Phe Asp Glu Asn He Glu Ser Gin 

1410 1415 1420 

Gly Glu Leu He Leu Gin Glu Ser Phe Gin Val Trp Asp Pro Lys Thr 
1425 1430 1435 1440 

Leu He Arg Lys Gly Arg Glu Arg His Leu Phe Leu Phe Glu Met Ser 

1445 1450 1455 

Leu Val Phe Ser Lys Glu Val Lys Asp Ser Ser Gly Arg Ser Lys Tyr 

1460 1465 1470 

Leu Tyr Lys Ser Lys Leu Phe Thr Ser Glu Leu Gly Val Thr Glu His 

1475 1480 1485 

Val Glu Gly Asp Pro Cys Lys Phe Ala Leu Trp Val Gly Arg Thr Pro 

1490 1495 1500 

Thr Ser Asp Asn Lys He Val Leu Lys Ala Ser Ser He Glu Asn Lys 
1505 1510 1515 1520 

Gin Asp Trp He Lys His He Arg Glu Val He Gin Glu Arg Thr He 

1525 1530 1535 

His Leu Lys Gly Ala Leu Lys Glu Pro He His He Pro Lys Thr Ala 

1540 1545 1550 

Pro Ala Thr Arg Gin Lys Gly Arg Arg Asp Gly Glu Asp Leu Asp Ser 

1555 1560 1565 

Gin Gly Asp Gly Ser Ser Gin Pro Asp Thr He Ser He Ala Ser Arg 

1570 1575 1580 

Thr Ser Gin Asn Thr Leu Asp Ser Asp Lys Leu Ser Gly Gly Cys Glu 
1585 1590 1595 1600 
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Leu Thr Val Val He His Asp Phe Thr Ala Cys Asn Ser Asn Glu Leu 

1605 1610 1615 

Thr He Arg Arg Gly Gin Thr Val Glu Val Leu Glu Arg Pro His Asp 

1620 1625 1630 

Lys Pro Asp Trp Cys Leu Val Arg Thr Thr Asp Arg Ser Pro Ala Ala 

1635 1640 1645 

Glu Gly Leu Val Pro Cys Gly Ser Leu Cys He Ala His Ser Arg Ser 

1650 1655 1660 

Ser Met Glu Met Glu Gly He Phe Asn His Lys Asp Ser Leu Ser Val 
1665 1670 1675 168i 

Ser Ser Asn Asp Ala Ser Pro Pro Ala Ser Val Ala Ser Leu Gin Pro 

1685 1690 1695 

His Met He Gly Ala Gin Ser Ser Pro Gly Pro Lys Arg Pro Gly Asn 

1700 1705 1710 

Thr Leu Arg Lys Trp Leu Thr Ser Pro Val Arg Arg Leu Ser Ser Gly 

1715 1720 1725 

Lys Ala Asp Gly His Val Lys Lys Leu Ala His Lys His Lys Lys Ser 

1730 1735 1740 

Arg Glu Val Arg Lys Ser Ala Asp Ala Gly Ser Gin Lys Asp Ser Asp 
1745 1750 1755 1761 

Asp Ser Ala Ala Thr Pro Gin Asp Glu Thr Val Glu Glu Arg Gly Arg 

1765 1770 1775 

Asn Glu Gly Leu Ser Ser Gly Thr Leu Ser Lys Ser Ser Ser Ser Gly 

1780 1785 1790 

Met Gin Ser Cys Gly Glu Glu Glu Gly Glu Glu Gly Ala Asp Ala Val 

1795 1800 1805 

Pro Leu Pro Pro Pro Met Ala lie Gin Gin His Ser Leu Leu Gin Pro 

1810 1815 1820 

Asp Ser Gin Asp Asp Lys Ala Ser Ser Arg Leu Leu Val Arg Pro Thr 
1825 1830 1835 184( 

Ser Ser Glu Thr Pro Ser Ala Ala Glu Leu Val Ser Ala He Glu Glu 

1845 1850 1855 

Leu Val Lys Ser Lys Met Ala Leu Glu Asp Arg Pro Ser Ser Leu Leu 

1860 1865 1870 

Val Asp Gin Gly Asp Ser Ser Ser Pro Ser Phe Asn Pro Ser Asp Asn 

1875 1880 1885 

Ser Leu Leu Ser Ser Ser Ser Pro He Asp Glu Met Glu Glu Arg Lys 

1890 1895 1900 

Ser Ser Ser Leu Lys Arg Arg His Tyr Val Leu Gin Glu Leu Val Glu 
1905 1910 1915 192( 

Thr Glu Arg Asp Tyr Val Arg Asp Leu Gly Tyr Val Val Glu Gly Tyr 

1925 1930 1935 

Met Ala Leu Met Lys Glu Asp Gly Val Pro Asp Asp Met Lys Gly Lys 

1940 1945 1950 

Asp Lys He Val Phe Gly Asn He His Gin He Tyr Asp Trp His Arg 

1955 1960 1965 

Asp Phe Phe Leu Gly Glu Leu Glu Lys Cys Leu Glu Asp Pro Glu Lys 

1970 1975 1980 

Leu Gly Ser Leu Phe Val Lys His Glu Arg Arg Leu His Met Tyr He 
1985 1990 1995 200C 

Ala Tyr Cys Gin Asn Lys Pro Lys Ser Glu His He Val Ser Glu Tyr 

2005 2010 2015 

He Asp Thr Phe Phe Glu Asp Leu Lys Gin Arg Leu Gly His Arg Leu 



2020 



2025 



2030 
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Gin Leu Thr Asp Leu Leu lie Lys Pro Val Gin Arg He Met Lys Tyr 

2035 2040 2045 

Gin Leu Leu Leu Lys Asp Phe Leu Lys Tyr Ser Lys Lys Ala Ser Leu 

2050 ~ 2055 2060 

Asp Thr Ser Glu Leu Glu Arg Ala Val Glu Val Met Cys He Val Pro 
2065 2070 2075 208C 

Arg Arg Cys Asn Asp Met Met Asn Val Gly Arg Leu Gin Gly Phe Asp 

2085 2090 2095 

Gly Lys He Val Ala Gin Gly Lys Leu Leu Leu Gin Asp Thr Phe Leu 

2100 ~ 2105 2110 

Val Thr Asp Gin Asp Ala Gly Leu Leu Pro Arg Cys Arg Glu Arg Arg 

2115 2120 2125 

He Phe Leu Phe Glu Gin He Val He Phe Ser Glu Pro Leu Asp Lys 

2130 2135 2140 

Lys Lys Gly Phe Ser Met Pro Gly Phe Leu Phe Lys Asn Ser He Lys 
2145 2150 " 2155 216( 

Val Ser Cys Leu Cys Leu Glu Glu Asn Val Glu Asn Asp Pro Cys Lys 

2165 2170 2175 

Phe Ala Leu Thr Ser Arg Thr Gly Asp Val Val Glu Thr Phe He Leu 

2180 2185 2190 

His Ser Ser Ser Pro Ser Val Arg Gin Thr Trp He His Glu He Asn 

2195 2200 2205 

Gin He Leu Glu Asn Gin Arg Asn Phe Leu Asn Ala Leu Thr Ser Pro 

2210 2215 2220 

He Glu Tyr Gin Arg Asn His Ser Gly Gly Gly Gly Gly Gly Gly Ser 
2225 2230 2235 224( 

Gly Ala Ala Ala Gly Val Gly Ala Ala Ala Ala Ala Gly Pro Pro Val 

2245 2250 2255 

Ala Ala Ala Ala Thr Val Ala Ala Pro Ala Ala Ala Ala Ala Pro Pro 

2260 2265 2270 

Ala Arg Ala Gly Ala Gly Pro Pro Gly Ser Pro Ser Leu Ser Asp Thr 

2275 2280 2285 

Thr Pro Pro Cys Trp Ser Pro Leu Gin Pro Arg Ala Arg Gin Arg Gin 

2290 2295 2300 

Thr Arg Cys Gin Ser Glu Ser Ser Ser Ser Ser Asn He Ser Thr Met 
2305 ' J 2310 2315 232( 

Leu Val Thr His Asp Tyr Thr Ala Val Lys Glu Asp Glu He Asn Val 

2325 2330 2335 

Tyr Gin Gly Glu Val Val Gin He Leu Ala Ser Asn Gin Gin Asn Met 

2340 2345 2350 

Phe Leu Val Phe Arg Ala Ala Thr Asp Gin Cys Pro Ala Ala Glu Gly 

2355 2360 2365 

Trp He Pro Gly Phe Val Leu Gly His Thr Ser Ala Val He Val Glu 

2370 2375 2380 

Asn Pro Asp Gly Thr Leu Lys Lys Ser Thr Ser Trp His Thr Ala Leu 
2385 2390 2395 240< 

Arg Leu Arg Lys Lys Ser Glu Lys Lys Asp Lys Asp Gly Lys Arg Glu 

2405 2410 2415 

Gly Lys Leu Glu Asn Gly Tyr Arg Lys Ser Arg Glu Gly Leu Ser Asn 

2420 2425 2430 

Lys Val Ser Val Lys Leu Leu Asn Pro Asn Tyr He Tyr Asp Val Pro 

2435 2440 2445 

Pro Glu Phe Val He Pro Leu Ser Glu Val Thr Cys Glu Thr Gly Glu 
2450 2455 2460 
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Thr Val Val Leu Arg Cys Arg Val Cys Gly Arg Pro Lys Ala Ser He 
2465 2470 2475 2481 

Thr Trp Lys Gly Pro Glu His Asn Thr Leu Asn Asn Asp Gly His Tyr 

2485 2490 2495 

Ser He Ser Tyr Ser Asp Leu Gly Glu Ala Thr Leu Lys He Val Gly 

2500 2505 2510 

Val Thr Thr Glu Asp Asp Gly He Tyr Thr Cys He Ala Val Asn Asp 

2515 2520 2525 

Met Gly Ser Ala Ser Ser Ser Ala Ser Leu Arg Val Leu Gly Pro Gly 

2530 2535 2540 

Met Asp Gly He Met Val Thr Trp Lys Asp Asn Phe Asp Ser Phe Tyr 
2545 2550 2555 2561 

Ser Glu Val Ala Glu Leu Gly Arg Gly Arg Phe Ser Val Val Lys Lys 

2565 2570 2575 

Cys Asp Gin Lys Gly Thr Lys Arg Ala Val Ala Thr Lys Phe Val Asn 

2580 2585 2590 

Lys Lys Leu Met Lys Arg Asp Gin Val Thr His Glu Leu Gly He Leu 

2595 2600 2605 

Gin Ser Leu Gin His Pro Leu Leu Val Gly Leu Leu Asp Thr Phe Glu 

2610 2615 2620 

Thr Pro Thr Ser Tyr He Leu Val Leu Glu Met Ala Asp Gin Gly Arg 
2625 2630 2635 264( 

Leu Leu Asp Cys Val Val Arg Trp Gly Ser Leu Thr Glu Gly Lys He 

2645 2650 2655 

Arg Ala His Leu Gly Glu Val Leu Glu Ala Val Arg Tyr Leu His Asn 

2660 2665 2670 

Cys Arg He Ala His Leu Asp Leu Lys Pro Glu Asn He Leu Val Asp 

2675 2680 * 2685 

Glu Ser Leu Ala Lys Pro Thr He Lys Leu Ala Asp Phe Gly Asp Ala 

2690 2695 2700 

Val Gin Leu Asn Thr Thr Tyr Tyr He His Gin Leu Leu Gly Asn Pro 
2705 2710 2715 272( 

Glu Phe Ala Ala Pro Glu He He Leu Gly Asn Pro Val Ser Leu Thr 

2725 2730 2735 

Ser Asp Thr Trp Ser Val Gly Val Leu Thr Tyr Val Leu Leu Ser Gly 

2740 2745 2750 

Val Ser Pro Phe Leu Asp Asp Ser Val Glu Glu Thr Cys Leu Asn He 

2755 2760 2765 

Cys Arg Leu Asp Phe Ser Phe Pro Asp Asp Tyr Phe Lys Gly Val Ser 

2770 2775 2780 

Gin Lys Ala Lys Glu Phe Val Cys Phe Leu Leu Gin Glu Asp Pro Ala 
2785 2790 2795 280C 

Lys Arg Pro Ser Ala Ala Leu Ala Leu Gin Glu Gin Trp Leu Gin Ala 

2805 2810 2815 

Gly Asn Gly Arg Ser Thr Gly Val Leu Asp Thr Ser Arg Leu Thr Ser 

2820 2825 2830 

Phe He Glu Arg Arg Lys His Gin Asn Asp Val Arg Pro He Arg Ser 

2835 2840 2845 

He Lys Asn Phe Leu Gin Ser Arg Leu Leu Pro Arg Val 
2850 2855 2860 

<210> 688 
<211> 271 
<212> PRT 



-258- 



BNSDOCID: <WO 02092000A2_I_> 




WO 02/092000 
<213> Homo sapiens 



<400> 688 



Met 


Val 


Leu 


He 


Lys 


Glu 


Phe 


Arg 


Val 


1 








5 










Glu 


Tyr 


Gin 


Val 


Gly Gin Leu Tyr 


Ser 








20 










25 


Glu 


Thr 


Gly Gly 


Gly Glu Gly 


He 


Glu 






35 










40 




Glu 


Lys 


Asp 


Gly 


Glu 


Lys 


Gly Gin 


Tyr 




50 










55 






Lys 


Ser 


Lys 


Val 


Pro 


Ala 


Phe 


Val 


Arg 


65 










70 








Leu 


Val 


Phe 


His 


Glu 
85 


Lys 


Ala 


Trp 


Asn 


lie 


Val 


Thr 


Asn 


Glu 


Tyr Met 


Lys 


Asp 








100 










105 


Thr 


Trp 


His 
115 


Lys 


Pro 


Asp 


Leu 


Gly 
120 


Thr 


Asp 


Pro 
130 


Asn 


Thr 


Trp 


Lys 


Thr 
135 


Val 


Glu 


Asp 


Arg 


Ser 


Gin 


Val 


Glu 


Pro 


Ala 


Asp 


145 










150 








Ala 


Leu 


Phe 


Gin 


Ser 
165 


Val 


Lys 


Thr 


Lys 


Trp 


Lys 


Lys 


Glu 
180 


Leu 


Ala 


Asn 


Ser 


Pro 
185 


Tyr 


Lys 


Leu 
195 


Val 


Thr 


He 


Lys 


Phe 
200 


Lys 


Val 


Glu 
210 


Asn 


Phe 


He 


Gin 


Lys 
215 


Gin 


Glu 


His 


Arg 


Gin 


Leu 


Phe 


Cys 


Trp 


He 


Asp 


225 










230 








Glu 


Asp 


lie 


Arg 


Arg 


Met 


Glu Asp 


Glu 










245 










Met 


Arg 


Lys 


Arg 
260 


Gly 


Ser 


Val 


Arg 


Gly 
265 



<210> 642 
<211> 11 
<212> DMA 

<213> Homo sapiens 

<400> 642 
gatgatgatg a 

<210> 643 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 643 
gatgatgatg a 




PCT/US02/14877 



Val 


Leu 


Pro 


Cys 


Ser 


Val 


Gin 


10 










15 




Val 


Ala 


Glu 


Ala 


Ser 


Lys 


Asn 










30 






Val 


Leu 


Lys 


Asn 


Glu 


Pro 


Tyr 








45 








Thr 


His 


Lys 


He 


Tyr 


His 


Leu 






60 










Met 


He 


Ala 


Pro 


Glu 


Gly 


Ser 




75 










80 


Ala 


Tyr 


Pro 


Tyr 


Cys 


Arg 


Thr 


90 










95 




Asp 


Phe 


Phe 


He 


Lys 


He 


Glu 










110 






Leu 


Glu 


Asn 


Val 


His 


Gly 


Leu 








125 








He 


Val 


His 


He 


Asp 


He 


Ala 






14 0 










Tyr 


Lys 


Ala 


Asp 


Glu 


Asp 


Pro 




15.5 










160 


Arg 


Gly Pro 


Leu 


Gly 


Pro 


Asn 


170 










175 




Asp 


Cys 


Pro 


Gin 


Met 


Cys 


Ala 










190 






Trp 


Trp Gly Leu 


Gin 


Ser 


Lys 








205 








Lys 


Arg 


He 


Phe 


Thr 


Asn 


Phe 






220 










Lys 


Trp 


He 


Asp 


Leu 


Thr 


Met 




235 










240 


Thr 


Gin 


Lys 


Glu 


Leu 


Glu 


Thr 


250 










255 




Thr 


Ser 


Ala 


Ala 


Asp 


Val 





270 



11 



11 

-259- 



BNSDOC1D: <WO 02092000A2J_> 





WO 02/092000 



PCT/US02/14877 



<210> 644 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 644 

gatgatgatg a 11 

<210> 645 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 645 

gatgatgatg a 11 

<210> 646 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 646 

gatgatgatg a 11 

<210> 647 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 647 

gatgatgatg a 11 

<210> 648 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 648 

gatgatgatg a 11 

<210> 649 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 649 

gatgatgatg a 11 

<210> 650 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 650 



-260- 



BNSDOCID: <WO 02092000 A2_l_> 




WO 02/092000 



PCT/US02/14877 



gatgatgatg a 

<210> 651 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 651 
gatgatgatg a 

<210> 652 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 652 
gatgatgatg a 

<210> 653 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 653 
gatgatgatg a 

<210> 654 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 654 
gatgatgatg a 

<210> 655 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 655 
gatgatgatg a 

<210> 656 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 656 
gatgatgatg a 

<210> 657 

<211> 11 

<212> DNA 

<213> Homo sapiens 



-261- 



BNSDOCID: <WO 



.02092000A2J_> 





WO 02/092000 



PCT7US02/14877 



<400> 657 
gatgatgatg a 



11 



<210> 658 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 658 

gatgatgatg a 11 

<210> 659 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 659 

gatgatgatg a 11 

<210> 660 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 660 

gatgatgatg a 11 

<210> 661 
<211> 11 
<212> DNA 

<213> Homo sapiens 
•<400> 661 

gatgatgatg a 11 

<210> 662 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 662 

gatgatgatg a 11 

<210> 663 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 663 

gatgatgatg a 11 

<210> 664 
<211> 11 
<212> DNA 



-262- 



BNSDOCID: <WO. 



02092000A2_I_> 





WO 02/092000 



PCT/US02/14877 



<213> Homo sapiens 

<400> 664 
gatgatgatg a 

<210> 665 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 665 
gatgatgatg a 

<210> 666 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 666 
gatgatgatg a 

<210> 667 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 667 
gatgatgatg a 

<210> 668 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 668 
gatgatgatg a 

<210> 669 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 669 
gatgatgatg a 

<210> 670 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 670 
gatgatgatg a 

<210> 671 



-263- 



BNSDOCID: <WO. 



02092000A2_I_> 





WO 02/092000 



PCT/US02/14877 



<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 671 
gatgatgatg a 



11 



<210> 672 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 672 

gatgatgatg a 11 

<210> 673 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 673 

gatgatgatg a 11 

<210> 674 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 674 

gatgatgatg a 11 

<210> 675 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 675 

gatgatgatg a 11 

<210> 676 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 676 

gatgatgatg a 11 

<210> 677 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 677 

gatgatgatg a 11 



-264- 



BNSDOC1D: <WO 02092000 A2_l_> 





WO 02/092000 



PCT/US02/14877 



<210> 678 
<211> 11 
<212> DNA 

<213> Homo sapiens 

<400> 678 
gatgatgatg a 

<210> 679 
<211> 11 
<212> DNA 

<213> Homo sapiens 

<400> 679 
gatgatgatg a 

<210> 680 
<211> 11 
<212> DNA 

<213> Homo sapiens 

<400> 680 
gatgatgatg a 

<210> 681 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 681 
gatgatgatg a 

<210> 682 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 682 
gatgatgatg a 

<210> 683 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 683 
gatgatgatg a 

<210> 684 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 684 



-265- 



BNSDOCID: <WO. 



02092000A2_I_> 





WO 02/092000 



PCT/US02/14877 



gatgatgatg a 



11 



<210> 685 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 685 

gatgatgatg a 11 

<210> 686 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 686 

gatgatgatg a 11 

<210> 687 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 687 

gatgatgatg a 11 

<210> 688 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 688 

gatgatgatg a 11 

<210> 689 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 689 

gtcagcctgg aggagttctc a 21 

<210> 690 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 690 

tcacccttgg caatacagat gt 22 

<210> 691 

<211> 21 

<212> DNA 

<213> Homo sapiens 



-266- 



BNSDOCID: <WO 02092000A2_I_> 




WO 02/092000 



<400> 691 

cccacccatg tgcccgtgac a 

<210> 692 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 692 

cgtgattgcc gacgatctc 

<210> 693 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 693 

ttccggccgc tagtcttgt 

<210> 694 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 694 

cgcacccgtt cggtctgacg cagtac 

<210> 695 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 695 

ctttccccac gagtatgttg gt 

<210> 696 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 696 

aagggaccgt gctgtgagc 

<210> 697 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<400> 697 

agcccctcat gtgcctctca acttcatag 

<210> 698 
<211> 6 
<212> PRT 
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21 



19 



19 



26 



22 



19 



29 
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<213> Artificial Sequence 
<220> 

<223> partial signal sequence of HBM and Zmaxl 
<400> 698 

Ser Pro Ala His Ser Ser 
1 5 



<210> 699 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic duplex 
<400> 699 

tcgaccacca tggaggcagc gccgc 25 

<210> 700 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic duplex 
<400> 700 

ggtggtacct ccgtcgcggc gggcc 25 

<210> 701 
<211> 55 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic duplex 
<400> 701 

catgtgtgaa atcaccaagc cgccctcaga cgacagcccg gcccacagca gtggc 55 

<210> 702 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic duplex 
<400> 702 

tcgagtacac actttagtgg ttcggcggga gtctgctgtc gggccgggtg tcgtcaccgc 60 
egg 63 
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<210> 703 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 703 

Val Glu Thr Pro Arg He Glu Arg Ala Gly Met Asp Gly Ser Thr Arg 

15 10 15 

Lys 



<210> 704 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 704 

Asn Lys Arg Thr Gly Gly Lys Arg Lys Glu He Leu Ser Ala Leu 



<210> 705 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 705 

Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu 
15 10 



<210> 706 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 706 

Val Asp Gly Thr Lys Arg Arg Thr Leu Leu Glu Asp Lys Leu Pro 
15 10 15 



1 



5 



10 



15 
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<210> 707 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 707 

Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Gin Cys Gly Gin 
1 5 10 15 



<210> 708 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 708 

Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn 
15 10 



<210> 709 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 709 

. Tyr Ala Gly Ala Asn Gly Pro Phe Pro His Glu Tyr Val Ser Gly 
15 10 15 



<210> 710 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 710 

Ala Cys Gly Lys Ser Met Met Ser Ser Val Ser Leu Met 
15 10 



<210> 711 
<211> 18 



-270- 



BNSDOCID: <WO 02092000A2J_> 





WO 02/092000 



PCT/US02/14877 



<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 711 

Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser Asp Ser Asp 

1 ~ 5 10 15 

Pro Tyr 



<210> 712 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 712 

Ser Gly Trp Asn Glu Cys Ala Ser Ser Asn Gly His Cys Ser His 
1 5 10 15 



<210> 713 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide from Zmaxl sequence (SEQ ID NO: 3) 
<400> 713 

Asn Gly Asp Ala Asn Cys Gin Asp Lys Ser Asp Glu Lys Asn 
15 10 



<210> 714 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 714 

lie Lys Arg Ala Lys Asp Asp Gly Thr Gin Pro Phe Val Leu Thr Ser 

1 * ~ 5 10 15 

Leu Ser Gin Gly Gin Asn 
20 



<210> 715 
<211> 38 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> primer 



<400> 715 

cgtacgtaaa gcggccgctt ggcaatacag atgtggga 



38 



<210> 716 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223 > primer 
<400> 716 

cgtacgtaaa gcggccgcag tagctcctct cggtggc 37 

<210> 717 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 717 

cgtacgtaaa gcggccgcgc tcatcatgga ctttccg 37 

<210> 718 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 718 

cgtacgtaaa gcggccgcgc actgctgttt gatgagg 37 

<210> 719 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 719 

cgtacgtaaa gcggccgcga gtgtggaaga aaggctgc 3 8 

<210> 720 
<211> 38 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 720 

cgtacgtaaa gcggccgcag tagagcttcc cctcctgc 

<210> 721 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 721 

cgtacgtaaa gcggccgcgt ccatcacgaa gtccaggt 

<210> 722 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 722 

cagcggcctg gaggatgc 

<210> 723 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 723 

cggtccagta gaggtttcg 

<210> 724 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 724 

catcagccgc gccttcatg 
<210> 725 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 725 

cctgcatgtt ggtgaagtac 2 0 

<210> 726 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 726 

gcccgaaacc tctactggac cgac 24 

<210> 727 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 727 

gcccacccca tcacagttca catt 24 

<210> 728 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 728 

cagcggcctg gaggatgc 18 

<210> 729 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 729 

cgggtacatg tactggacag ctgattagc 2 9 
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<210> 730 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 730 

gtcagcctgg aggagttctc a 

<210> 731 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 731 

tcacccttgg caatacagat gt 

<210> 732 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 732 

cccacccatg tgcccgtgac a 

<210> 733 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 733 

cgtgattgcc gacgatctc 

<210> 734 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 734 

ttccggccgc tagtcttgt 

<210> 735 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<400> 735 

cgcacccgtt cggtctgacg cagtac 

<210> 736 
<211> 22 
<212> DNA 
<213> Mouse 

<400> 736 
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ctttccccac gagtatgttg gt 



22 



<210> 737 
<211> 19 
<212> DNA 
<213> Mouse 

<400> 737 

aagggaccgt gctgtgagc 19 

<210> 738 
<211> 29 
<212> DNA 
<213> Mouse 

<400> 738 

agcccctcat gtgcctctca acttcatag 29 

<210> 739 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 739 

gagcgggcag ggatggatgg 2 0 

<210> 740 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 740 

aggttggcac ggtggatgaa gc 22 

<210> 741 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
. <400> 741 

aagcttgttt aaactgggca tggtggcaca tggttgtaat 4 0 

<210> 742 
<211> 34 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> primer 



<400> 742 

gggcttccac ccagtcagtc cagtacatgt acct 



34 



<210> 743 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 743 

ctgactgggt ggaagcaccc cggatcgagc 3 0 

<210> 744 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 744 

gaattcatcg gtacctgtgc ggccgcttca ttg 3 3 

<210> 745 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> primer 
<400> 745 

tgagatgtcc tgtctgtggc 20 

<210> 746 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 746 

tccttccttc cctacagttg 20 

<210> 747 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 747 

cctaaggatc tccttgtgtc tgtgg 25 

<210> 748 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 748 

ctgcagcagg tcagtagcct gc 22 

<210> 749 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 749 

tgagatgtcc tgtctgtggc 2 0 

<210> 750 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 750 

tccttccttc cctacagttg 20 

<210> 751 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 751 

gcttctggcg tgtgaccggc g 21 



<210> 752 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 752 

gccgcacagc gccagcagca gc 22 

<210> 753 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 753 

cacccacgcc ccacagccag ta 22 

<210> 754 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> primer 
<400> 754 

atttgccctc ccatatgtcc ttcc 24 

<210> 755 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 755 

ttcctcccag ccctcctcca tcag 24 

<210> 756 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 756 

gccgcacagc gccagcagca gc 22 
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<210> 757 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> primer 
<400> 757 

gaatggcgcc cccgacgac 19 

<210> 758 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> primer 
<400> 758 

gctcccattc atcagttcca tagg 24 

<210> 759 
<211> 5278 
<212> DNA 
<213> Mouse 

<400> 759 

tctagactcg aggcggccgc ccattgtgca ctaaagcgcc gccgccgcgc catggagccc 60 

gagtgagcgc ggcgcgggcc cgtccggccg ccggacaaca tggaggcagc gccgcccggg 12 0 

ccgccgtggc cgctgctgct gctgctgctg ctgctgctgg cgctgtgcgg ctgcccggcc 180 

cccgccgcgg cctcgccgct cctgctattt gccaaccgcc gggacgtacg gctggtggac 24 0 

gccggcggag tcaagctgga gtccaccatc gtggtcagcg gcctggagga tgcggccgca 3 00 

gtggacttcc agttttccaa gggagccgtg tactggacag acgtgagcga ggaggccatc 360 

aagcagacct acctgaacca gacgggggcc gccgtgcaga acgtggtcat ctccggcctg 420 

gtctctcccg acggcctcgc ctgcgactgg gtgggcaaga agctgtactg gacggactca 4 80 

gagaccaacc gcatcgaggt ggccaacctc aatggcacat cccggaaggt gctcttctgg 54 0 

caggaccttg accagccgag ggccatcgcc ttggaccccg ctcacgggta catgtactgg 600 

acagactggg ttgagacgcc ccggattgag cgggcaggga tggatggcag cacccggaag 660 

atcattgtgg actcggacat ttactggccc aatggactga ccatcgacct ggaggagcag 72 0 

aagctctact gggctgacgc caagctcagc ttcatccacc gtgccaacct ggacggctcg 78 0 

ttccggcaga aggtggtgga gggcagcctg acgcacccct tcgccctgac gctctccggg 84 0 

gacactctgt actggacaga ctggcagacc cgctccatcc atgcctgcaa caagcgcact 90 0 

ggggggaaga ggaaggagat cctgagtgcc ctctactcac ccatggacat ccaggtgctg 960 

agccaggagc ggcagccttt cttccacact cgctgtgagg aggacaatgg cggctgctcc 102 0 

cacctgtgcc tgctgtcccc aagcgagcct ttctacacat gcgcctgccc cacgggtgtg 10 8 0 

cagctgcagg acaacggcag gacgtgtaag gcaggagccg aggaggtgct gctgctggcc 114 0 

cggcggacgg acctacggag gatctcgctg gacacgccgg acttcaccga catcgtgctg 12 00 

caggtggacg acatccggca cgccattgcc atcgactacg acccgctaga gggctatgtc 1260 

tactggacag atgacgaggt gcgggccatc cgcagggcgt acctggacgg gtctggggcg 132 0 

cagacgctgg tcaacaccga gatcaacgac cccgatggca tcgcggtcga ctgggtggcc 13 8 0 

cgaaacctct actggaccga cacgggcacg gaccgcatcg aggtgacgcg cctcaacggc 144 0 

acctcccgca agatcctggt gtcggaggac ctggacgagc cccgagccat cgcactgcac 15 0 0 

cccgtgatgg gcctcatgta ctggacagac tggggagaga accctaaaat cgagtgtgcc 1560 
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aacttggatg 
ctggccctgg 
gaggtgatca 
attttcgggt 
atcgagcggg 
ctgatggggc 
aggaacgggg 
cccatcggcc 
gtcttcacca 
gccatcccgc 
cacatctact 
tcggtggagc 
atgggcaaga 
gacgggcagt 
ctggatccca 
cgggccttca 
gacctcacca 
atcgagtcgt 
ccgttcggtc 
attgagcggg 
ttcgtgatgg 
cacaacaacg 
tgcgcctcac 
ttgctgttca 
gatctcatcc 
gacaagttca 
acccagccct 
gacctcagca 
atcaacgtcc 
aagcccaggg 
gaccgggcag 
accaccggcc 
tgggtggacg 
accctggagg 
tactggatcg 
cggactcgca 
agcctggagg 
tgtattgcca 
cagaacctgc 
acaggggaga 
gaccagagcg 
ggtcagtgtg 
gacgaggcgg 
tgtgtcctca 
ctcatgtgtg 
gggcccgtca 
cagcgcgtgg 
agcgggaccc 
ttcacaggca 
ggcggggtgc 
agcacgaagg 
ccctccctgt 
tacaggccct 
tgtgacagcg 



ggcaggagcg 
acctgcagga 
atgttgatgg 
tcacgctgct 
tgcacaaggt 
tcaaagctgt 
ggtgcagcca 
tggagctgct 
gcagagccgc 
tcacgggcgt 
ggacagacgt 
acgtggtgga 
acctctactg 
tccggcaagt 
ccaagggcta 
tggacgggac 
ttgactacgc 
ccaacatgct 
tgacgcagta 
ccgacaagac 
acatcctggt 
ggcagtgtgg 
actacaccct 
gccagaaatc 
tgcccctgca 
tctactgggt 
ttgttttgac 
tcgacatcta 
acaggctgag 
ccatcgtcgt 
ccaagatcga 
tcatccgccc 
cggacctgaa 
acgccaacat 
accgccagca 
tccagggccg 
agttctcagc 
agggtgatgg 
tgacctgtgg 
tcgactgtat 
acgaggaggg 
tggacctgcg 
actgtgacgc 
tcaaacagca 
aaatcaccaa 
ttggcatcat 
tgtgccagcg 
cgcacgtgcc 
tcgcatgcgg 
ccctctacga 
ccacgctgta 
acaacatgga 
acatcattcg 
actacagcgc 



gcgtgtgctg 
ggggaagctc 
gacgaagagg 

gggggacttc 

caaggccagc 
gaatgtggcc 
cctgtgcttc 
gagtgacatg 
catccacagg 
caaggaggcc 
cagcctgaag 
gtttggcctt 
ggccgacact 
cctcgtgtgg 
catctactgg 
caactgcatg 
tgaccagcgc 
gggtcaggag 
cagcgattat 
tagcggccgg 
gttccactcc 
gcagctgtgc 
ggaccccagc 
tgccatcagt 
tggactgagg 
ggatgggcgc 
ctctctgagc 
cagccggaca 
cggggaagcc 
caacgcggag 
acgcgcagcc 

tgtggccctg 

gcgcattgag 
cgtgcagcct 
gcagatgatc 
tgtcgcccac 
ccacccatgt 
gacaccacgg 
agagccgccc 
ccccggggcc 
ctgccccgtg 
cctgcgctgc 
catctgcctg 
gtgcgactcc 
gccgccctca 
cctctctctc 
ctatgcgggg 
cctcaatttc 
aaagtccatg 
ccggaaccac 
cccgccgatc 
catgttctac 
aggaatggcg 
cagccgctgg 



gtcaatgcct 
tactggggag 
cggaccctcc 
atctactgga 
cgggacgtca 
aaggtcgtcg 
ttcacacccc 
aagacctgca 
atctccctcg 
tcagccctgg 
accatcagcc 
gactaccccg 
gggaccaaca 
agggacttgg 
accgagtggg 
acgctggtgg 
ctctactgga 
cgggtcgtga 
atctactgga 
aaccgcaccc 
tcccgccagg 
cttgccatcc 
agccgcaact 
cggatgatcc 
aacgtcaaag 
cagaacatca 
caaggccaaa 
ctgttctgga 
atgggggtgg 
cgagggtacc 
ctggacggca 
gtggtagaca 
agctgtgacc 
ctgggcctga 
gagcgtgtgg 
ctcactggca 
gcccgtgaca 
tgctcatgcc 
acctgctccc 
tggcgctgtg 
tgctccgccg 
gacggcgagg 
cccaaccagt 
ttccccgact 
gacgacagcc 
ttcgtcatgg 
gccaacgggc 
atagccccgg 
atgagctccg 
gtcacagggg 
ctgaacccgc 
tcttcaaaca 
cccccgacga 
aaggccagca 



ccctcgggtg 
acgccaagac 
tggaggacaa 
ctgactggca 
tcattgacca 
gaaccaaccc 
acgcaacccg 
tcgtgcctga 
agaccaataa 
actttgatgt 
gcgccttcat 
agggcatggc 
gaatcgaagt 
acaacccgag 
gcggcaagcc 
acaaggtggg 
ccgacctgga 
ttgccgacga 
cagactggaa 
tcatccaggg 
atggcctcaa 
ccggcggcca 
gcagcccgcc 
cggacgacca 
ccatcgacta 
agcgagccaa 
acccagacag 
cgtgcgaggc 
tgctgcgtgg 
tgtacttcac 
ccgagcgcga 
acacactggg 
tgtcaggggc 
ccatccttgg 
agaagaccac 
tccatgcagt 
atggtggctg 
cagtccacct 
cggaccagtt 
acggctttcc 
cccagttccc 
cagactgtca 
tccggtgtgc 
gtatcgacgg 
cggcccacag 
gtggtgtcta 
ccttcccgca 
gcggttccca 
tgagcctgat 
cctcgtccag 
cgccctcccc 
ttccggccac 
cgccctgcag 
agtactacct 



gcccaacggc 
agacaagatc 
gctcccgcac 
gcgccgcagc 
gctgcccgac 
gtgtgcggac 
gtgtggctgc 
ggccttcttg 
caacgacgtg 
gtccaacaac 
gaacgggagc 
cgttgactgg 
ggcgcggctg 
gtcgctggcc 
gaggatcgtg 
ccgggccaac 
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tcggactcag acccctatcc acccccaccc acgccccaca gccagtacct gtcggcggag 4860 

gacagctgcc cgccctcgcc cgccaccgag aggagctact tccatctctt cccgccccct 4920 

ccgtccccct gcacggactc atcctgacct cggccgggcc actctggctt ctctgtgccc 4980 

ctgtaaatag ttttaaatat gaacaaagaa aaaaatatat tttatgattt aaaaaataaa 5040 

tataattggg attttaaaaa catgagaaat gtgaactgtg atggggtggg cagggctggg 5100 

agaactttgt acagtggaga aatatttata aacttaattt tgtaaaacag aaaaaaaaaa 5160 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 5220 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa gcggccgc 5278 

<210> 760 

<211> 5278 

<212> DNA 

<213> Homo sapiens 

<400> 760 

tctagactcg aggcggccgc ccattgtgca ctaaagcgcc gccgccgcgc catggagccc 60 

gagtgagcgc ggcgcgggcc cgtccggccg ccggacaaca tggaggcagc gccgcccggg 12 0 

ccgccgtggc cgctgctgct gctgctgctg ctgctgctgg cgctgtgcgg ctgcccggcc 18 0 

cccgccgcgg cctcgccgct cctgctattt gccaaccgcc gggacgtacg gctggtggac 24 0 

gccggcggag tcaagctgga gtccaccatc gtggtcagcg gcctggagga tgcggccgca 3 00 

gtggacttcc agttttccaa gggagccgtg tactggacag acgtgagcga ggaggccatc 360 

aagcagacct acctgaacca gacgggggcc gccgtgcaga acgtggtcat ctccggcctg 42 0 

gtctctcccg acggcctcgc ctgcgactgg gtgggcaaga agctgtactg gacggactca 4 80 

gagaccaacc gcatcgaggt ggccaacctc aatggcacat cccggaaggt gctcttctgg 54 0 

caggaccttg accagccgag ggccatcgcc ttggaccccg ctcacgggta catgtactgg 600 

acagactggg gtgagacgcc ccggattgag cgggcaggga tggatggcag cacccggaag 660 

atcattgtgg actcggacat ttactggccc aatggactga ccatcgacct ggaggagcag 720 

aagctctact gggctgacgc caagctcagc ttcatccacc gtgccaacct ggacggctcg 780 

ttccggcaga aggtggtgga gggcagcctg acgcacccct tcgccctgac gctctccggg 840 

gacactctgt actggacaga ctggcagacc cgctccatcc atgcctgcaa caagcgcact 900 

ggggggaaga ggaaggagat cctgagtgcc ctctactcac ccatggacat ccaggtgctg 960 

agccaggagc ggcagccttt cttccacact cgctgtgagg aggacaatgg cggctgctcc 102 0 

cacctgtgcc tgctgtcccc aagcgagcct ttctacacat gcgcctgccc cacgggtgtg 1080 

cagctgcagg acaacggcag gacgtgtaag gcaggagccg aggaggtgct gctgctggcc 114 0 

cggcggacgg acctacggag gatctcgctg gacacgccgg acttcaccga catcgtgctg 12 00 

caggtggacg acatccggca cgccattgcc atcgactacg acccgctaga gggctatgtc 1260 

tactggacag atgacgaggt gcgggccatc cgcagggcgt acctggacgg gtctggggcg 1320 

cagacgctgg tcaacaccga gatcaacgac cccgatggca tcgcggtcga ctgggtggcc 13 80 

cgaaacctct actggaccga cacgggcacg gaccgcatcg aggtgacgcg cctcaacggc 144 0 

acctcccgca agatcctggt gtcggaggac ctggacgagc cccgagccat cgcactgcac 15 0 0 

cccgtgatgg gcctcatgta ctggacagac tggggagaga accctaaaat cgagtgtgcc 1560 

aacttggatg ggcaggagcg gcgtgtgctg gtcaatgcct ccctcgggtg gcccaacggc 1620 

ctggccctgg acctgcagga ggggaagctc tactggggag acgccaagac agacaagatc 168 0 

gaggtgatca atgttgatgg gacgaagagg cggaccctcc tggaggacaa gctcccgcac 174 0 

attttcgggt tcacgctgct gggggacttc atctactgga ctgactggca gcgccgcagc 1800 

atcgagcggg tgcacaaggt caaggccagc cgggacgtca tcattgacca gctgcccgac 18 60 

ctgatggggc tcaaagctgt gaatgtggcc aaggtcgtcg gaaccaaccc gtgtgcggac 192 0 

aggaacgggg ggtgcagcca cctgtgcttc ttcacacccc acgcaacccg gtgtggctgc 198 0 

cccatcggcc tggagctgct gagtgacatg aagacctgca tcgtgcctga ggccttcttg 2 04 0 

gtcttcacca gcagagccgc catccacagg atctccctcg agaccaataa caacgacgtg 2100 

gccatcccgc tcacgggcgt caaggaggcc tcagccctgg actttgatgt gtccaacaac 2160 

cacatctact ggacagacgt cagcctgaag accatcagcc gcgccttcat gaacgggagc 222 0 

tcggtggagc acgtggtgga gtttggcctt gactaccccg agggcatggc cgttgactgg 22 80 

atgggcaaga acctctactg ggccgacact gggaccaaca gaatcgaagt ggcgcggctg 234 0 
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gacgggcagt tccggcaagt cctcgtgtgg agggacttgg acaacccgag gtcgctggcc 2400 

ctggatccca ccaagggcta catctactgg accgagtggg gcggcaagcc gaggatcgtg 2460 

cgggccttca tggacgggac caactgcatg acgctggtgg acaaggtggg ccgggccaac 2520 

gacctcacca ttgactacgc tgaccagcgc ctctactgga ccgacctgga caccaacatg 2580 

atcgagtcgt ccaacatgct gggtcaggag cgggtcgtga ttgccgacga tctcccgcac 2640 

ccgttcggtc tgacgcagta cagcgattat atctactgga cagactggaa tctgcacagc 2700 

attgagcggg ccgacaagac tagcggccgg aaccgcaccc tcatccaggg ccacctggac 2760 

ttcgtgatgg acatcctggt gttccactcc tcccgccagg atggcctcaa tgactgtatg 2820 

cacaacaacg ggcagtgtgg gcagctgtgc cttgccatcc ccggcggcca ccgctgcggc 28 80 

tgcgcctcac actacaccct ggaccccagc agccgcaact gcagcccgcc caccaccttc 294 0 

ttgctgttca gccagaaatc tgccatcagt cggatgatcc cggacgacca gcacagcccg 3 000 

gatctcatcc tgcccctgca tggactgagg aacgtcaaag ccatcgacta tgacccactg 3 060 

gacaagttca tctactgggt ggatgggcgc cagaacatca agcgagccaa ggacgacggg 3120 

acccagccct ttgttttgac ctctctgagc caaggccaaa acccagacag gcagccccac 3180 

gacctcagca tcgacatcta cagccggaca ctgttctgga cgtgcgaggc caccaatacc 3240 

atcaacgtcc acaggctgag cggggaagcc atgggggtgg tgctgcgtgg ggaccgcgac 33 00 

aagcccaggg ccatcgtcgt caacgcggag cgagggtacc tgtacttcac caacatgcag 33 60 

gaccgggcag ccaagatcga acgcgcagcc ctggacggca ccgagcgcga ggtcctcttc 342 0 

accaccggcc tcatccgccc tgtggccctg gtggtagaca acacactggg caagctgttc 3480 

tgggtggsicg cggacctgaa gcgcattgag agctgtgacc tgtcaggggc caaccgcctg 3 54 0 

accctggagg acgccaacat cgtgcagcct ctgggcctga ccatccttgg caagcatctc 3 600 

tactggatcg accgccagca gcagatgatc gagcgtgtgg agaagaccac cggggacaag 3 660 

cggactcgca tccagggccg tgtcgcccac ctcactggca tccatgcagt ggaggaagtc 3720 

agcctggagg agttctcagc ccacccatgt gcccgtgaca atggtggctg ctcccacatc 3780 

tgtattgcca agggtgatgg gacaccacgg tgctcatgcc cagtccacct cgtgctcctg 3 840 

cagaacctgc tgacctgtgg agagccgccc acctgctccc cggaccagtt tgcatgtgcc 3 900 

acaggggaga tcgactgtat ccccggggcc tggcgctgtg acggctttcc cgagtgcgat 3 960 

gaccagagcg acgaggaggg ctgccccgtg tgctccgccg cccagttccc ctgcgcgcgg 4020 

ggtcagtgtg tggacctgcg cctgcgctgc gacggcgagg cagactgtca ggaccgctca 4080 

gacgaggcgg actgtgacgc catctgcctg cccaaccagt tccggtgtgc gagcggccag 4140 

tgtgtcctca tcaaacagca gtgcgactcc ttccccgact gtatcgacgg ctccgacgag 42 00 

ctcatgtgtg aaatcaccaa gccgccctca gacgacagcc cggcccacag cagtgccatc 42 60 

gggcccgtca ttggcatcat cctctctctc ttcgtcatgg gtggtgtcta ttttgtgtgc 4320 

cagcgcgtgg tgtgccagcg ctatgcgggg gccaacgggc ccttcccgca cgagtatgtc 43 80 

agcgggaccc cgcacgtgcc cctcaatttc atagccccgg gcggttccca gcatggcccc 4440 

ttcacaggca tcgcatgcgg aaagtccatg atgagctccg tgagcctgat ggggggccgg 45 00 

ggcggggtgc ccctctacga ccggaaccac gtcacagggg cctcgtccag cagctcgtcc 4560 

agcacgaagg ccacgctgta cccgccgatc ctgaacccgc cgccctcccc ggccacggac 4620 

ccctccctgt acaacatgga catgttctac tcttcaaaca ttccggccac tgcgagaccg 4680 

tacaggccct acatcattcg aggaatggcg cccccgacga cgccctgcag caccgacgtg 4740 

tgtgacagcg actacagcgc cagccgctgg aaggccagca agtactacct ggatttgaac 4800 

tcggactcag acccctatcc acccccaccc acgccccaca gccagtacct gtcggcggag 4860 

gacagctgcc cgccctcgcc cgccaccgag aggagctact tccatctctt cccgccccct 4920 

ccgtccccct gcacggactc atcctgacct cggccgggcc actctggctt ctctgtgccc 4980 

ctgtaaatag ttttaaatat gaacaaagaa aaaaatatat tttatgattt aaaaaataaa 5040 

tataattggg attttaaaaa catgagaaat gtgaactgtg atggggtggg cagggctggg 5100 

agaactttgt acagtggaga aatatttata aacttaattt tgtaaaacag aaaaaaaaaa 5160 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 5220 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa gcggccgc 5278 

<210> 761 
<211> 1615 
<212> PRT 

<213> Homo sapiens 
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Gin 


Gly His 
885 


Leu 


Asp 


Phe 


Val 


Met 
890 


Asp 


He 


Leu 


Val 


Phe 
895 


His 


Ser 


Ser 


Arg 


Gin Asp 
900 


Gly 


Leu 


Asn 


Asp 
905 


Cys 


Met 


His 


Asn 


Asn 
910 


Gly 


Gin 


Cys 


Gly 


Gin 
915 


Leu Cys 


Leu 


Ala 


He 
920 


Pro 


Gly 


Gly 


His 


Arg 
925 


Cys 


Gly 


Cys 


Ala 


Ser 
930 


His 


Tyr Thr 


Leu 


Asp 
935 


Pro 


Ser 


Ser 


Arg 


Asn 
940 


Cys 


Ser 


Pro 


Pro 


Thr 


Thr 


Phe 


Leu Leu 


Phe 


Ser 


Gin 


Lys 


Ser 


Ala 


He 


Ser 


Arg 


Met 


He 


945 








950 










955 










960 


Pro 


Asp 


Asp 


Gin His 
965 


Ser 


Pro 


Asp 


Leu 


He 
970 


Leu 


Pro 


Leu 


His 


Gly 
975 


Leu 


Arg 


Asn 


Val 


Lys Ala 
980 


He 


Asp 


Tyr 


Asp 
985 


Pro 


Leu 


Asp 


Lys 


Phe 
990 


He 


Tyr 


Trp 


Val 


Asp 


Gly Arg 


Gin 


Asn 


He 


Lys 


Arg 


Ala 


Lys 


Asp 


Asp 


Gly 


Thr 






995 








1000 








1005 






Gin 


Pro 


Phe 


Val Leu 


Thr 


Ser 


Leu 


Ser 


Gin 


Gly 


Gin 


Asn 


Pro 


Asp 


Arg 




1010 






1015 








1020 








Gin 


Pro 


His 


Asp Leu 


Ser 


He 


Asp 


He 


Tyr 


Ser 


Arg 


Thr 


Leu 


Phe 


Trp 


1025 






1030 








1035 








1040 


Thr 


Cys 


Glu 


Ala Thr 


Asn 


Thr 


He 


Asn 


Val 


His 


Arg 


Leu 


Ser 


Gly 


Glu 








1045 








1050 








1055 


Ala 


Met 


Gly 


Val Val 


Leu Arg Gly Asp 


Arg 


Asp 


Lys 


Pro 


Arg 


Ala 


He 








1060 








1065 








1070 




Val 


Val 


Asn 


Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met 


Gin 


Asp 






1075 






1080 








1085 






Arg 


Ala 


Ala 


Lys He 


Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg 


Glu 




1090 






1095 








1100 








Val 


Leu 


Phe 


Thr Thr 


Gly 


Leu 


He 


Arg 


Pro 


Val 


Ala 


Leu 


Val 


Val 


Asp 


1105 






1110 








1115 








1120 


Asn 


Thr 


Leu 


Gly Lys 


Leu 


Phe 


Trp Val Asp Ala Asp 


Leu 


Lys 


Arg 


He 








1125 








1130 








1135 


Glu 


Ser 


Cys 


Asp Leu 


Ser 


Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 








1140 








1145 








1150 




Asn 


He 


Val 


Gin Pro Leu Gly Leu Thr He Leu Gly Lys 


His 


Leu 


Tyr 






1155 






1160 








1165 






Trp 


He 


Asp Arg Gin 


Gin 


Gin 


Met 


He Glu Arg Val 


Glu 


Lys 


Thr 


Thr 




1170 






1175 








1180 








Gly Asp Lys Arg Thr Arg 


He 


Gin Gly Arg 


Val 


Ala 


His 


Leu 


Thr Gly 


1185 






1190 








1195 








1200 


He 


His 


Ala 


Val Glu 


Glu 


Val 


Ser 


Leu 


Glu 


Glu 


Phe 


Ser 


Ala 


His 


Pro 








1205 








1210 








1215 


Cys 


Ala Arg Asp Asn Gly Gly Cys 


Ser 


His 


He 


Cys 


He 


Ala 


Lys 


Gly 








1220 








1225 








1230 




Asp 


Gly Thr 


Pro Arg 


Cys 


Ser 


Cys 


Pro 


Val 


His 


Leu 


Val 


Leu 


Leu 


Gin 






1235 






1240 








1245 






Asn 


Leu 


Leu 


Thr Cys 


Gly Glu 


Pro 


Pro 


Thr 


Cys 


Ser 


Pro 


Asp 


Gin 


Phe 




1250 






1255 








1260 








Ala 


Cys 


Ala 


Thr Gly Glu 


He 


Asp 


Cys 


He 


Pro Gly Ala 


Trp 


Arg 


Cys 


1265 






1270 








1275 








1280 


Asp 


Gly Phe 


Pro Glu 


Cys 


Asp 


Asp 


Gin 


Ser 


Asp 


Glu 


Glu 


Gly 


Cys 


Pro 
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1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Ala Asp Cys Asp Ala lie Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 " 1335 1340 

Ser Gly Gin Cys Val Leu lie Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys lie Asp Gly Ser Asp Glu Leu Met Cys Glu lie Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala lie Gly Pro Val lie Gly 

1380 1385 1390 

lie lie Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe lie Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly lie Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 1550 

Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 762 
<211> 1564 
<212> PRT 
<213> Mouse 

<400> 762 

Met Glu Thr Ala Pro Thr Arg Ala Pro Pro Pro Pro Pro Pro Pro Leu 

15 10 15 

Leu Leu Leu .Val Leu Tyr Cys Ser Leu Val Pro Ala Ala Ala Ser Pro 

20 25 30 

Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala Gly 
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35 










40 




Gly 


Val 


Lys 


Leu 


Glu 


Ser 


Thr 


He 


Val Ala 




50 










55 






Ala 


Ala 


Val 


Asp 


Phe 


Gin 


Phe 


Ser 


Lys Gly 


65 










70 








Val 


Ser 


Glu 


Glu 


Ala 


He 


Lys 


Gin 


Thr Tyr 










85 








90 


Ala 


Ala 


Gin 


Asn 


He 


Val 


He 


Ser Gly Leu 








100 










105 


Ala 


Cys 


Asp 


Trp 


Val 


Gly 


Lys 


Lys 


Leu Tyr 






115 










120 




Asn 


Arg 


He 


Glu 


Val 


Ala 


Asn 


Leu Asn Gly 




130 










135 






Phe 


Trp 


Gin 


Asp 


Leu Asp 


Gin 


Pro Arg Ala 


145 










150 








His 


Gly 


Tyr 


Met 


Tyr 


Trp 


Thr 


Asp 


Trp Gly 










165 








170 


Arg 


Ala 


Gly 


Met 


Asp 


Gly Ser Thr Arg Lys 








180 










185 


He 


Tyr 


Trp 


Pro 


Asn 


Gly 


Leu 


Thr 


He Asp 






195 










200 




Tyr 


Trp 


Ala 


Asp 


Ala 


Lys 


Leu 


Ser 


Phe He 




210 










215 






Gly 


Ser 


Phe 


Arg 


Gin 


Lys 


Val 


Val 


Glu Gly 


225 










230 








Ala 


Leu 


Thr 


Leu 


Ser 


Gly Asp 


Thr 


Leu Tyr 










245 








250 


Arg 


Ser 


He 


His 


Ala 


Cys 


Asn 


Lys 


Trp Thr 








260 










265 


He 


Leu 


Ser 


Ala 


Leu 


Tyr 


Ser 


Pro 


Met Asp 






275 










280 




Glu 


Arg 


Gin 


Pro 


Pro 


Phe 


His 


Thr 


Pro Cys 




290 










295 






Cys 


Ser 


His 


Leu 


Cys 


Leu 


Leu 


Ser 


Pro Arg 


305 










310 








Ala 


Cys 


Pro 


Thr Gly Val 


Gin 


Leu 


Gin Asp 










325 








330 


Thr 


Gly 


Ala 


Glu 


Glu 


Val 


Leu 


Leu 


Leu Ala 








340 










345 


Arg 


He 


Ser 


Leu Asp 


Thr 


Pro 


Asp 


Phe Thr 






355 










360 




Gly 


Asp 


He 


Arg 


His 


Ala 


He 


Ala 


He Asp 




370 










375 






Tyr 


Val 


Tyr 


Trp 


Thr 


Asp 


Asp 


Glu 


Val Arg 


385 










390 








Leu 


Asp 


Gly 


Ser 


Gly Ala 


Gin 


Thr 


Leu Val 










405 








410 


Pro 


Asp 


Gly 


He 


Ala 


Val 


Asp 


Trp 


Val Ala 








420 










425 


Asp 


Thr 


Gly 


Thr Asp Arg He Glu Val Thr 






435 










440 




Arg 


Lys 


He 


Leu 


Val 


Ser 


Glu 


Asp 


Leu Asp 




450 










455 






Leu 


His 


Pro 


Val 


Met 


Gly 


Leu 


Met 


Tyr Trp 
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45 



Ser 


Gly Leu 


Glu 


Asp 


Ala 




60 










Ala 


Val 


Tyr 


Trp 


Thr 


Asp 


75 










80 


Leu 


Asn 


Gin 


Thr Gly 


Ala 










95 




Val 


Ser 


Pro 


Asp Gly 


Leu 








110 






Trp 


Thr 


Asp 


Ser 


Glu 


Thr 






125 








Thr 


Ser 


Arg 


Lys 


Val 


Leu 




140 










He 


Ala 


Leu 


Asp 


Pro 


Ala 


155 










160 


Glu 


Ala 


Pro 


Arg 


He 


Glu 










175 




He 


He 


Val 


Asp 


Ser 


Asp 








190 






Leu 


Glu 


Glu 


Gin 


Lys 


Leu 






205 








His 


Arg 


Ala 


Asn 


Leu 


Asp 




220 










Ser 


Leu 


Thr 


His 


Pro 


Phe 


235 










240 


Trp Thr Asp 


Trp 


Gin 


Thr 










255 




Gly Glu Gin 


Arg 


Lys 


Glu 








270 






He 


Gin 


Val 


Leu 


Ser 


Gin 






285 








Glu Glu Asp 


Asn Gly 


Gly 




300 










Glu 


Pro 


Phe 


Tyr 


Ser 


Cys 


315 










320 


Asn Gly Lys 


Thr 


Cys 


Lys 










335 




Arg Arg 


Thr 


Asp 


Leu 


Arg 








350 






Asp 


He 


Val 


Leu 


Gin 


Val 






365 








Tyr 


Asp 


Pro 


Leu 


Glu 


Gly 




380 










Ala 


He 


Arg 


Arg 


Ala 


Tyr 


395 










400 


Asn 


Thr 


Glu 


He 


Asn 


Asp 










415 




Arg Asn 


Leu 


Tyr 


Trp 


Thr 








430 






Arg 


Leu 


Asn 


Gly Thr 


Ser 






445 








Glu 


Pro 


Arg 


Ala 


He 


Val 




460 










Thr 


Asp 


Trp 


Gly Glu 


Asn 
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465 








470 










475 










480 


Pro 


Lys 


He Glu 


Cys 


Ala 


Asn 


Leu 


Asp 


Gly 


Arg 


Asp 


Arg 


His 


Val 


Leu 








485 










490 










495 




Val 


Asn 


Thr Ser Leu Gly 


Trp 


Pro 


Asn 


Gly 


Leu 


Ala 


Leu 


Asp 


Leu 


Gin 






500 










505 










510 






Glu 


Gly 


Lys Leu 


Tyr 


Trp 


Gly Asp 


Ala 


Lys 


Thr 


Asp 


Lys 


He 


Glu 


Val 






515 








520 










525 








lie 


Asn 


He Asp Gly Thr 


Lys 


Arg 


Lys 


Thr 


Leu 


Leu 


Glu 


Asp 


Lys 


Leu 




530 








535 










540 










Pro 


His 


He Phe 


Gly 


Phe 


Thr 


Leu 


Leu 


Gly 


Asp 


Phe 


He 


Tyr 


Trp 


Thr 


545 








550 










555 










560 


Asp 


Trp 


Gin Arg Arg 


Ser 


He 


Glu 


Arg 


Val 


His 


Lys 


Val 


Lys 


Ala 


Ser 








565 










570 










575 




Arg 


Asp 


Val He 


He 


Asp 


Gin 


Leu 


Pro 


Asp 


Leu 


Met 


Gly 


Leu 


Lys 


Ala 






580 










585 










590 






Val 


Asn 


Val Ala 


Lys 


Val 


Val 


Gly 


Thr 


Asn 


Pro 


Cys 


Ala 


Asp 


Gly Asn 






595 








600 










605 








Gly 


Gly 


Cys Ser 


His 


Leu 


Cys 


Phe 


Phe 


Thr 


Pro 


Arg 


Ala 


Thr 


Lys 


Cys 




610 








615 










620 










Gly 


Cys 


Pro He 


Gly Leu 


Glu 


Leu 


Leu 


Ser 


Asp 


Met 


Lys 


Thr 


Cys 


He 


625 








630 










635 










640 


lie 


Pro 


Glu Ala 


Phe 


Leu 


Val 


Phe 


Thr 


Ser 


Arg 


Ala 


Thr 


He 


His 


Arg 








645 










650 










655 




lie 


Ser 


Leu Glu 


Thr 


Asn 


Asn 


Asn Asp 


Val 


Ala 


He 


Pro 


Leu 


Thr 


Gly 






660 










665 










670 






Val 


Lys 


Glu Ala 


Ser 


Ala 


Leu 


Asp 


Phe 


Asp 


Val 


Ser 


Asn 


Asn 


His 


He 






675 








680 










685 








Tyr 


Trp 


Thr Asp 


Val 


Ser 


Leu 


Lys 


Thr 


He 


Ser 


Arg 


Ala 


Phe 


Met 


Asn 




690 








695 










700 










Gly 


Ser 


Ser Val 


Glu 


His 


Val 


He 


Glu 


Phe 


Gly 


Leu 


Asp 


Tyr 


Pro 


Glu 


705 








710 










715 










720 


Gly 


Met 


Ala Val 


Asp 


Trp 


Met 


Gly 


Lys 


Asn 


Leu 


Tyr 


Trp 


Ala 


Asp 


Thr 








725 










730 










735 




Gly 


Thr 


Asn Arg 


He 


Glu 


Val 


Ala 


Arg 


Leu 


Asp 


Gly 


Gin 


Phe 


Arg 


Gin 






740 










745 










750 






Val 


Leu 


Val Trp 


Arg 


Asp 


Leu 


Asp 


Asn 


Pro 


Arg 


Ser 


Leu 


Ala 


Leu 


Asp 






755 








760 










765 








Pro 


Thr 


Lys Gly 


Tyr 


He 


Tyr 


Trp 


Thr 


Glu 


Trp 


Gly 


Gly 


Lys 


Pro 


Arg 




770 








775 










780 










lie 


Val 


Arg Ala 


Phe 


Met 


Asp Gly Thr Asn 


Cys 


Met 


Thr 


Leu 


Val 


Asp 


785 








790 










795 










800 


Lys 


Val 


Gly Arg Ala Asn 


Asp 


Leu 


Thr 


He 


Asp 


Tyr 


Ala 


Asp 


Gin 


Arg 








805 










810 










815 




Leu 


Tyr 


Trp Thr 


Asp 


Leu 


As P 


Thr 


Asn 


Met 


He 


Glu 


Ser 


Ser 


Asn 


Met 






820 










825 










830 






Leu 


Gly 


Gin Glu 


Arg 


Met 


Val 


He 


Ala 


Asp 


Asp 


Leu 


Pro 


Tyr 


Pro 


Phe 






835 








840 










845 








Gly 


Leu 


Thr Gin 


Tyr 


Ser 


Asp 


Tyr 


He 


Tyr 


Trp 


Thr 


Asp 


Trp 


Asn 


Leu 




850 








855 










860 










His 


Ser 


He Glu 


Arg 


Ala 


Asp 


Lys 


Thr 


Ser 


Gly 


Arg 


Asn 


Arg 


Thr 


Leu 


865 








870 










875 










880 


He 


Gin 


Gly His 


Leu Asp 


Phe 


Val 


Met 


Asp 


He 


Leu 


Val 


Phe 


His 


Ser 








885 










890 










895 




Ser 


Arg 


Gin Asp 


Gly Leu 


Asn 


Asp 


Cys 


Val 


His 


Ser 


Asn 


Gly 


Gin 


Cys 
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900 



905 



910 



Gly 


Gin 


Leu 
915 


Cys 


Leu 


Ala 


He 


Pro 
920 


Gly 


Gly 


His 


Arg 


Cys 
925 


Gly 


Cys 


Ala 


Ser 


His 


Tyr 


Thr 


Leu Asp 


Pro 


Ser 


Ser 


Arg 


Asn 


Cys 


Ser 


Pro 


Pro 


Ser 




930 










935 










94 0 










Thr 


Phe 


Leu 


Leu 


Phe 


Ser 


Gin 


Lys 


Phe 


Ala 


He 


Ser 


Arg 


Met 


He 


Pro 


945 










950 










955 










960 


Asp 


Asp 


Gin 


Leu 


Ser 
965 


Pro 


Asp 


Leu 


Val 


Leu 
970 


Pro 


Leu 


His 


Gly 


Leu 
975 


Arg 


Asn 


Val 


Lys 


Ala 
980 


lie 


Asn 


Tyr 


Asp 


Pro 
985 


Leu 


Asp 


Lys 


Phe 


He 
990 


Tyr 


Trp 


Val 


Asp 


Gly 


Arg 


Gin 


Asn 


He 


Lys 


Arg 


Ala 


Lys 


Asp 


Asp 


Gly 


Thr 


Gin 






995 










1000 








1005 






Pro 


Ser 


Met 


Leu 


Thr 


Ser 


Pro 


Ser 


Gin 


Ser 


Leu 


Ser 


Pro Asp 


Arg 


Gin 




1010 








1015 








1020 








Pro 


His 


Asp 


Leu 


Ser 


He 


Asp 


He 


Tyr 


Ser 


Arg 


Thr 


Leu 


Phe 


Trp 


Thr 


1025 








1030 








1035 








1041 


Cys 


Glu 


Ala 


Thr 


Asn 


Thr 


He 


Asn 


Val 


His 


Arg 


Leu 


Asp 


Gly 


Asp 


Ala 










1045 








1050 








1055 


Met 


Gly Val 


Val 


Leu Arg Gly Asp 


Arg 


Asp 


Lys 


Pro 


Arg 


Ala 


He 


Ala 








1060 








1065 








1070 




Val 


Asn 


Ala 


Glu 


Arg 


Gly 


Tyr 


Met 


Tyr 


Phe 


Thr 


Asn 


Met 


Gin Asp 


His 



1075 1080 1085 



Ala Ala Lys He Glu Arg Ala Ser Leu Asp Gly Thr Glu Arg Glu Val 

1090 1095 ^ 1100 

Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp Asn 
1105 1110 1115 1121 

Ala Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He Glu 

1125 1130 1135 

Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala Asn 

1140 1145 1150 

He Val Gin Pro Val Gly Leu Thr Val Leu Gly Arg His Leu Tyr Trp 

1155 1160 1165 

He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr Gly 

1170 ~ 1175 1180 

Asp Lys Arg Thr Arg Val Gin Gly Arg Val Thr His Leu Thr Gly He 
1185 1190 1195 120< 

His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro Cys 

1205 1210 1215 

Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly Asp 

1220 1225 1230 

Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin Asn 

1235 1240 1245 

Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe Ala 

1250 1255 1260 

Cys Thr Thr Gly Glu He Asp Cys He Pro Gly Ala Trp Arg Cys Asp 
1265 1270 1275 1281 

Gly Phe Pro Glu Cys Ala Asp Gin Ser Asp Glu Glu Gly Cys Pro Val 

1285 1290 1295 

Cys Ser Ala Ser Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp Leu 

1300 1305 1310 

Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp Glu 

1315 1320 1325 

Ala Asn Cys Asp Ala Val Cys Leu Pro Asn Gin Phe Arg Cys Thr Ser 
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1330 1335 1340 

Gly Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin Arg Val Met 
1345 1350 1355 1360 

Cys Gin Arg Tyr Thr Gly Ala Ser Gly Pro Phe Pro His Glu Tyr Val 

1365 1370 1375 

Gly Gly Ala Pro His Val Pro Leu Asn Phe lie Ala Pro Gly Gly Ser 

1380 1385 1390 

Gin His Gly Pro Phe Pro Gly lie Pro Cys Ser Lys Ser Val Met Ser 

1395 1400 1405 

Ser Met Ser Leu Val Gly Gly Arg Gly Ser Val Pro Leu Tyr Asp Arg 

1410 1415 1420 

Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser Thr Lys Ala 
1425 1430 1435 1440 

Thr Leu Tyr Pro Pro lie Leu Asn Pro Pro Pro Ser Pro Ala Thr Asp 

1445 1450 1455 

Pro Ser Leu Tyr Asn Val Asp Val Phe Tyr Ser Ser Gly lie Pro Ala 

1460 1465 1470 

Thr Ala Arg Pro Tyr Arg Pro Tyr Val lie Arg Gly Met Ala Pro Pro 

1475 1480 1485 

Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr Ser lie Ser 

1490 1495 1500 

Arg Trp Lys Ser Ser Lys Tyr Tyr Leu Asp Leu Asn Ser Asp Ser Asp 
1505 1510 1515 1520 

Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu Ser Ala Glu 

1525 1530 1535 

Asp Ser Cys Pro Pro Ser Pro Gly Thr Glu Arg Ser Tyr Cys His Leu 

1540 1545 1550 

Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1555 1560 



<210> 763 
<211> 1665 
<212> PRT 

<213> Homo sapiens 



<400> 763 
Met Glu Ala Ala 
1 

Leu Leu Leu Leu 
20 

Pro Leu Leu Leu 
35 

Gly Gly Val Lys 
50 

Ala Ala Ala Val 
65 

Asp Val Ser Glu 

Ala Ala Val Gin 
100 

Leu Ala Cys Asp 
115 

Thr Asn Arg lie 



Pro Pro Gly Pro 
5 

Ala Leu Cys Gly 

Phe Ala Asn Arg 
40 

Leu Glu Ser Thr 
55 

Asp Phe Gin Phe 
70 

Glu Ala lie Lys 
85 

Asn Val Val He 

Trp Val Gly Lys 
120 

Glu Val Ala Asn 



Pro Trp Pro Leu 
10 

Cys Pro Ala Pro 
25 

Arg Asp Val Arg 

He Val Val Ser 
60 

Ser Lys Gly Ala 
75 

Gin Thr Tyr Leu 
90 

Ser Gly Leu Val 
105 

Lys Leu Tyr Trp 
Leu Asn Gly Thr 



Leu Leu Leu Leu 
15 

Ala Ala Ala Ser 
30 

Leu Val Asp Ala 
45 

Gly Leu Glu Asp 

Val Tyr Trp Thr 
80 

Asn Gin Thr Gly 
95 

Ser Pro Asp Gly 
110 

Thr Asp Ser Glu 
125 

Ser Arg Lys Val 
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130 










135 




Leu 


Phe 


Trp 


Gin Asp 


Leu 


Asp 


Gin 


145 










150 






Ala 


His 


Gly 


Tyr 


Met 


Tyr 


Trp 


Thr 










165 








Glu 


Arg 


Ala 


Gly Met 


Asp 


Gly 


Ser 








180 










Asp 


He 


Tyr 


Trp 


Pro 


Asn 


Gly 


Leu 






195 










200 


lieu 


Tyr 


Trp 


Ala Asp Ala 


Lys 


Leu 




210 










215 




Asp 


Gly 


Ser 


Phe 


Arg 


Gin 


Lys 


Val 


225 










230 






Phe 


Ala 


Leu 


Thr 


Leu 


Ser 


Gly Asp 










245 








Thr 


Arg 


Ser 


He 


His 


Ala 


Cys 


Asn 








260 










Glu 


He 


Leu 


Ser 


Ala 


Leu 


Tyr 


Ser 






275 










280 


Gin 


Glu 


Arg 


Gin 


Pro 


Phe 


Phe 


His 




290 










295 




Gly 


Cys 


Ser 


His 


Leu 


Cys 


Leu 


Leu 


305 










310 






Cys 


Ala 


Cys 


Pro 


Thr 


Gly Val 


Gin 










325 








Lys 


Ala 


Gly 


Ala 


Glu 


Glu 


Val 


Leu 








340 










Arg 


Arg 


He 


Ser Leu Asp 


Thr 


Pro 






355 










360 


Val 


Asp 


Asp 


He 


Arg 


His 


Ala 


He 




370 










375 




Gly 


Tyr 


Val 


Tyr 


Trp 


Thr 


Asp 


Asp 


385 










390 






Tyr 


Leu 


Asp 


Gly Ser 


Gly Ala 


Gin 










405 








Asp 


Pro 


Asp 


Gly 


He 


Ala 


Val 


Asp 








42 0 










Thr 


Asp 


Thr 


Gly Thr Asp Arg 


He 






435 










440 


Ser 


Arg 


Lys 


He 


Leu 


Val 


Ser 


Glu 




450 










455 




Ala 


Leu 


His 


Pro 


Val 


Met 


Gly Leu 


465 










470 






Asn 


Pro 


Lys 


He 


Glu 


Cys 


Ala 


Asn 










485 








Leu 


Val 


Asn 


Ala 


Ser Leu Gly Trp 








500 










Gin 


Glu 


Gly 


Lys 


Leu 


Tyr 


Trp 


Gly 






515 










520 


Val 


He 


Asn 


Val 


Asp 


Gly Thr 


Lys 




530 










535 




Leu 


Pro 


His 


He 


Phe 


Gly 


Phe 


Thr 


545 










550 






Thr 


Asp 


Trp 


Gin Arg Arg Ser 


He 



140 



Pro Arg 


Ala 


He 


Ala 


Leu 


Asp 


Pro 






155 










160 


Asp 


Trp 


Gly Glu 


Thr 


Pro 


Arg 


He 




170 










175 




Thr Arg 


L Y S 


He 


He 


Val 


Asp 


Ser 


185 










190 






Thr 


He 


Asp 


Leu 


Glu 


Glu 


Gin 


Lys 










205 








Ser 


Phe 


He 


His 


Arg 


Ala 


Asn 


Leu 








220 










Val 


Glu 


Gly 


Ser 


Leu 


Thr 


His 


Pro 






235 










240 


Thr 


Leu 


Tyr 


Trp 


Thr 


Asp 


Trp 


Gin 




250 










255 




Lys 


Arg 


Thr 


Gly 


Gly 


Lys 


Arg 


Lys 


265 










270 






Pro 


Met 


Asp 


He 


Gin 


Val 


Leu 


Ser 










285 








Thr Arg 


Cys 


Glu 


Glu 


Asp 


Asn 


Gly 








300 










Ser 


Pro 


Ser 


Glu 


Pro 


Phe 


Tyr 


Thr 






315 










320 


Leu 


Gin 


Asp 


Asn 


Gly 


Arg 


Thr 


Cys 




330 










335 




Leu 


Leu 


Ala 


Arg 


Arg 


Thr 


Asp 


Leu 


345 










350 






Asp 


Phe 


Thr 


Asp 


He 


Val 


Leu 


Gin 










365 








Ala 


He 


Asp 


Tyr 


Asp 


Pro 


Leu 


Glu 








380 










Glu 


Val 


Arg 


Ala 


He 


Arg 


Arg 


Ala 






395 










400 


Thr 


Leu 


Val 


Asn 


Thr 


Glu 


He 


Asn 




410 










415 




Trp 


Val 


Ala 


Arg 


Asn 


Leu 


Tyr 


Trp 


425 










430 






Glu 


Val 


Thr 


Arg 


Leu 


Asn 


Gly 


Thr 










445 








Asp 


Leu 


Asp 


Glu 


Pro 


Arg 


Ala 


He 








460 










Met 


Tyr 


Trp 


Thr 


Asp 


Trp 


Gly 


Glu 






475 










480 


Leu Asp 


Gly Gin 


Glu 


Arg 


Arg 


Val 




490 










495 




Pro 


Asn 


Gly Leu 


Ala 


Leu 


Asp 


Leu 


505 










510 






Asp Ala 


Lys 


Thr 


Asp 


Lys 


He 


Glu 










525 








Arg Arg 


Thr 


Leu 


Leu 


Glu 


Asp 


Lys 








540 










Leu 


Leu 


Gly Asp 


Phe 


He 


Tyr 


Trp 






555 










560 


Glu 


Arg 


Val 


His 


Lys 


Val 


Lys 


Ala 
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565 








570 










575 




Ser 


Arg 


Asp 


Val 
580 


He 


He 


Asp 


Gin 


Leu Pro 
585 


Asp 


Leu 


Met 


Gly 
590 


Leu 


Lys 


Ala 


Val 


Asn 
595 


Val 


Ala 


Lys 


Val 


Val 
600 


Gly Thr 


Asn 


Pro 


Cys 
605 


Ala 


Asp 


Arg 


Asn 


Gly 
610 


Gly 


Cys 


Ser 


His 


Leu 
615 


Cys 


Phe Phe 


Thr 


Pro 
620 


His 


Ala 


Thr 


Arg 


Cys 


Gly 


Cys 


Pro 


He 


Gly 


Leu 


Glu 


Leu Leu 


Ser 


Asp 


Met 


Lys 


Thr 


Cys 


625 










630 








635 










640 


He 


Val 


Pro 


Glu 


Ala 
645 


Phe 


Leu 


Val 


Phe Thr 
650 


Ser 


Arg 


Ala 


Ala 


He 
655 


His 


Arg 


He 


Ser 


Leu 


Glu 


Thr 


Asn 


Asn Asn Asp 


Val 


Ala 


He 


Pro 


Leu 


Thr 








660 










665 








670 






Gly 


Val 


Lys 


Glu 


Ala 


Ser 


Ala 


Leu Asp Phe 


Asp 


Val 


Ser 


Asn 


Asn 


His 






675 










680 








685 








He 


Tyr 


Trp 


Thr Asp 


Val 


Ser 


Leu 


Lys Thr 


He 


Ser 


Arg 


Ala 


Phe 


Met 




690 










695 








700 










Asn 


Gly 


Ser 


Ser 


Val 


Glu 


His 


Val 


Val Glu 


Phe 


Gly 


Leu 


Asp 


Tyr 


Pro 


705 










710 








715 










720 


Glu 


Gly 


Met 


Ala 


Val 
725 


Asp 


Trp 


Met 


Gly Lys 
730 


Asn 


Leu 


Tyr 


Trp 


Ala 
735 


Asp 


Thr 


Gly 


Thr 


Asn Arg 


He 


Glu Val Ala Arg 


Leu 


Asp 


Gly 


Gin 


Phe 


Arg 








740 










745 








750 






Gin 


Val 


Leu 


Val 


Trp 


Arg 


Asp 


Leu Asp Asn 


Pro 


Arg 


Ser 


Leu 


Ala 


Leu 






755 










760 








765 








Asp 


Pro 
770 


Thr 


Lys 


Gly 


Tyr 


He 
775 


Tyr 


Trp Thr 


Glu 


Trp 
780 


Gly 


Gly 


Lys 


Pro 


Arg 


He 


Val 


Arg Ala 


Phe 


Met 


Asp 


Gly Thr 


Asn 


Cys 


Met 


Thr 


Leu 


Val 


785 










790 








795 










800 


Asp 


Lys 


Val 


Gly Arg 


Ala 


Asn 


Asp 


Leu Thr 


He 


Asp 


Tyr 


Ala 


Asp 


Gin 










805 








810 










815 




Arg 


Leu 


Tyr 


Trp 
820 


Thr 


Asp 


Leu 


Asp 


Thr Asn 
825 


Met 


He 


Glu 


Ser 
830 


Ser 


Asn 


Met 


Leu 


Gly 
835 


Gin 


Glu 


Arg 


Val 


Val 
840 


He Ala 


Asp 


Asp 


Leu 
845 


Pro 


His 


Pro 


Phe 


Gly 
850 


Leu 


Thr 


Gin 


Tyr 


Ser 
855 


Asp 


Tyr He 


Tyr 


Trp 
860 


Thr 


Asp 


Trp 


Asn 


Leu 


His 


Ser 


He 


Glu 


Arg 


Ala 


Asp 


Lys Thr 


Ser 


Gly 


Arg 


Asn 


Arg 


Thr 


865 










870 








875 










880 


Leu 


He 


Gin 


Gly His 


Leu 


Asp 


Phe 


Val Met 


Asp 


He 


Leu 


Val 


Phe 


His 










885 








890 










895 




Ser 


Ser 


Arg 


Gin 


Asp 


Gly 


Leu 


Asn Asp Cys 


Met 


His 


Asn 


Asn 


Gly 


Gin 








900 










905 








910 






c ys 


Gly 


Gin 
915 


Leu 


Cys 


Leu 


Ala 


He 
920 


Pro Gly 


Gly 


His 


Arg 
925 


Cys 


Gly 


Cys 


Ala 


Ser 
930 


His 


Tyr 


Thr 


Leu 


Asp 
935 


Pro 


Ser Ser 


Arg 


Asn 
940 


Cys 


Ser 


Pro 


Pro 


Thr 


Thr 


Phe 


Leu 


Leu 


Phe 


Ser 


Gin 


Lys Ser 


Ala 


He 


Ser 


Arg 


Met 


He 


945 










950 








955 










960 


Pro 


Asp 


Asp 


Gin 


His 
965 


Ser 


Pro 


Asp 


Leu He 
970 


Leu 


Pro 


Leu 


His 


Gly 
975 


Leu 


Arg 


Asn 


Val 


Lys 


Ala 


He 


Asp 


Tyr Asp Pro 


Leu 


Asp 


Lys 


Phe 


He 


Tyr 








980 










985 








990 






Trp 


Val 


Asp 


Gly Arg 


Gin 


Asn 


Phe 


Ser Gin 


Lys 


Ser 


Ala 


He 


Asn 


Arg 
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995 



1000 



1005 



Met Val lie Asp Glu Gin Gin Ser Pro Asp lie lie Leu Pro lie His 

1010 1015 1020 

Ser Leu Arg Asn Val Arg Ala He Asp Tyr Asp Pro Leu Asp Lys Gin 
1025 1030 1035 1041 

Leu Tyr Trp He Asp Ser Arg Gin Asn He Lys Arg Ala Lys Asp Asp 

1045 1050 1055 

Gly Thr Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn Pro 

1060 1065 1070 

Asp Arg Gin Pro His Asp Leu Ser He Asp He Tyr Ser Arg Thr Leu 

1075 1080 1085 

Phe Trp Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu Ser 

1090 1095 1100 

Gly Glu Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg 
1105 1110 1115 112( 

Ala He Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met 

1125 1130 1135 

Gin Asp Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu 

1140 1145 1150 

Arg Glu Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val 

1155 1160 1165 

Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys 

1170 1175 1180 

Arg He Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu 
1185 1190 1195 120( 

Asp Ala Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His 

1205 1210 1215 

Leu Tyr Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys 

1220 1225 1230 

Thr Thr Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu 

1235 1240 1245 

Thr Gly He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala 

1250 1255 1260 

His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala 
1265 1270 1275 128( 

Lys Gly Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu 

1285 1290 1295 

Leu Gin Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp 

1300 1305 1310 

Gin Phe Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala Trp 

1315 1320 1325 

Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly 

1330 1335 1340 

Cys Pro Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys 
1345 1350 1355 136( 

Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg 

1365 1370 1375 

Ser Asp Glu Ala Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg 

1380 1385 1390 

Cys Ala Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser Phe 

1395 1400 1405 

Pro Asp Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys 

1410 1415 1420 

Pro Pro Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val 
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1425 143 0 1435 144 0 

lie Gly He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val 

1445 1450 1455 

Cys Gin Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe 

1460 1465 1470 

Pro His Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He 

1475 1480 1485 

Ala Pro Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly 

1490 1495 1500 

Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val 
1505 1510 1515 1520 

Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser 

1525 1530 1535 

Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro 

1540 1545 1550 

Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser 

1555 1560 1565 

Ser Asn He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He Arg 

1570 1575 1580 

Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser 
1585 1590 1595 1600 

Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu 

1605 1610 1615 

Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin 

1620 1625 1630 

Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg 

1635 1640 1645 

Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser 
1650 1655 1660 

Ser 
1665 



<210> 764 
<211> 1613 
<212> PRT 

<213> Homo sapiens 
<400> 764 

Met Gly Ala Val Leu Arg Ser Leu Leu Ala Cys Ser Phe Cys Val Leu 

15 10 15 

Leu Arg Ala Ala Pro Leu Leu Leu Tyr Ala Asn Arg Arg Asp Leu Arg 

20 25 30 

Leu Val Asp Ala Thr Asn Gly Lys Glu Asn Ala Thr He Val Val Gly 

35 40 45 

Gly Leu Glu Asp Ala Ala Ala Val Asp Phe Val Phe Ser His Gly Leu 

50 55 60 

He Tyr Trp Ser Asp Val Ser Glu Glu Ala He Lys Arg Thr Glu Phe 
65 70 75 80 

Asn Lys Thr Glu Ser Val Gin Asn Val Val Val Ser Gly Leu Leu Ser 

85 90 95 

Pro Asp Gly Leu Ala Cys Asp Trp Leu Gly Glu Lys Leu Tyr Trp Thr 

100 105 110 

Asp Ser Glu Thr Asn Arg He Glu Val Ser Asn Leu Asp Gly Ser Leu 
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115 








12 0 




Arg 


Lys 


Val 


Leu 


Phe 


Trp Gin 


Glu 


Leu 




130 








135 






Leu 


Asp 


Pro 


Ser 


Ser Gly Phe 


Met 


Tyr 


145 










150 






Pro 


Lys 


He 


Glu 


Arg Ala Gly 


Met 


Asp 










165 








lie 


Asn 


Ser 


Glu 


He 


Tyr Trp 


Pro 


Asn 








180 








185 


Glu 


Gin 


Lys 


Leu 


Tyr 


Trp Ala 


Asp 


Ala 






195 








200 




Ser 


Asn 


Leu 


Asp 


Gly Thr Asn 


Arg 


Gin 




210 








215 






Pro 


His 


Pro 


Phe 


Ala 


Leu Thr 


Leu 


Phe 


225 










230 






Asp 


Trp 


Ser 


Thr 


His 


Ser He 


Leu 


Ala 










245 








Gly 


Leu 


Arg 


Glu 


He 


His Ser 


Asp 


He 








260 








265 


Ala 


Phe 


Ser 


Gin 


Gin Arg Gin 


Pro 


Asn 






275 








280 




Asp 


Asn 


Gly 


Gly 


Cys 


Ser His 


Leu 


Cys 




290 








295 






Phe 


Tyr 


Gin 


Cys 


Ala 


Cys Pro 


Thr 


Gly 


305 










310 






Lys 


Thr 


Cys 


Lys 


Asp 


Gly Ala 


Thr 


Glu 










325 








Thr 


Asp 


Leu 


Arg 


Arg 


He Ser 


Leu 


Asp 








340 








345 


Val 


Leu 


Gin 


Leu 


Glu 


Asp He 


Arg 


His 






355 








360 




Pro 


Val 


Glu 


Gly 


Tyr 


He Tyr 


Trp 


Thr 




370 








375 






Arg 


Arg 


Ser 


Phe 


He 


Asp Gly 


Ser 


Gly 


385 










390 






Gin 


lie 


Ala 


His 


Pro 


Asp Gly 


He 


Ala 










405 








Leu 


Tyr 


Trp 


Thr 


Asp Thr Gly 


Thr 


Asp 








420 








425 


Asn 


Gly 


Thr 


Met 


Arg 


Lys He 


Leu 


He 






435 








440 




Arg 


Ala 


He 


Val 


Leu 


Asp Pro 


Met 


Val 




450 








455 






Trp 


Gly 


Glu 


He 


Pro 


Lys He 


Glu 


Arg 


465 










470 






Arg 


Val 


Val 


Leu 


Val 


Asn Thr 


Ser 


Leu 










485 








Leu 


Asp 


Tyr 


Asp 


Glu Gly Lys 


He 


Tyr 








500 








505 


Lys 


lie 


Glu 


Val 


Met 


Asn Thr 


Asp 


Gly 






515 








520 




Glu 


Asp 


Lys 


He 


Pro 


His He 


Phe 


Gly 




530 








535 






Val 


Tyr 


Trp 


Thr 


Asp 


Trp Gin 


Arg 


Arg 




PCT/US02/14877 



125 



Asp 


Gin 


Pro 


Arg 


Ala 


He 


Ala 






140 










Trp 


Thr 


Asp 


Trp 


Gly 


Glu 


Val 




155 










160 


Gly 


Ser 


Ser 


Arg 


Phe 


He 


He 


170 










175 




Gly 


Leu 


Thr 


Leu 


Asp 


Tyr 


Glu 










190 






Lys 


Leu 


Asn 


Phe 


He 


His 


Lys 








205 








Ala 


Val 


Val 


Lys 


Gly 


Ser 


Leu 






220 










Glu 


Asp 


He 


Leu 


Tyr 


Trp 


Thr 




235 










240 


Cys 


Asn 


Lys 


Tyr 


Thr 


Gly 


Glu 


250 










255 




Phe 


Ser 


Pro 


Met 


Asp 


He 


His 










270 






Ala 


Thr 


Asn 


Pro 


Cys 


Gly 


He 








285 








Leu 


Met 


Ser 


Pro 


Val 


Lys 


Pro 






300 










Val 


Lys 


Leu 


Leu 


Glu 


Asn 


Gly 




315 










320 


Leu 


Leu 


Leu 


Leu 


Ala 


Arg 


Arg 


330 










335 




Thr 


Pro 


Asp 


Phe 


Thr 


Asp 


He 










350 






Ala 


He 


Ala 


He 


Asp 


Tyr 


Asp 








365 








Asp 


Asp 


Glu 


Val 


Arg 


Ala 


He 






380 










Ser 


Gin 


Phe 


Val 


Val 


Thr 


Ala 




395 










400 


Val 


Asp 


Trp 


Val 


Ala 


Arg 


Asn 


410 










415 




Arg 


He 


Glu 


Val 


Thr 


Arg 


Leu 










430 






Ser 


Glu 


Asp 


Leu 


Glu 


Glu 


Pro 








445 








Gly 


Tyr 


Met 


Tyr 


Trp 


Thr 


Asp 






460 










Ala 


Ala 


Leu 


Asp 


Gly 


Ser 


Asp 




475 










480 


Gly 


Trp 


Pro 


Asn 


Gly 


Leu 


Ala 


490 










495 




Trp 


Gly 


Asp 


Ala 


Lys 


Thr 


Asp 










510 






Thr 


Gly 


Arg 


Arg 


Val 


Leu 


Val 








525 








Phe 


Thr 


Leu 


Leu 


Gly 


Asp 


Tyr 






540 










Ser 


He 


Glu 


Arg 


Val 


His 


Lys 
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545 550 555 560 

Arg Ser Ala Glu Arg Glu Val He He Asp Gin Leu Pro Asp Leu Met 

565 570 575 

Gly Leu Lys Ala Thr Asn Val His Arg Val He Gly Ser Asn Pro Cys 

580 585 590 

Ala Glu Glu Asn Gly Gly Cys Ser His Leu Cys Leu Tyr Arg Pro Gin 

595 600 605 

Gly Leu Arg Cys Ala Cys Pro He Gly Phe Glu Leu He Ser Asp Met 

610 615 620 

Lys Thr Cys He Val Pro Glu Ala Phe Leu Leu Phe Ser Arg Arg Ala 
625 ~ 630 635 640 

Asp He Arg Arg He Ser Leu Glu Thr Asn Asn Asn Asn Val Ala He 

645 650 655 

Pro Leu Thr Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Thr 

660 665 670 

Asp Asn Arg He Tyr Trp Thr Asp He Ser Leu Lys Thr He Ser Arg 

675 680 685 

Ala Phe Met Asn Gly Ser Ala Leu Glu His Val Val Glu Phe Gly Leu 

690 695 700 

Asp Tyr Pro Glu Gly Met Ala Val Asp Trp Leu Gly Lys Asn Leu Tyr 
705 710 715 720 

Trp Ala Asp Thr Gly Thr Asn Arg He Glu Val Ser Lys Leu Asp Gly 

725 730 735 

Gin His Arg Gin Val Leu Val Trp Lys Asp Leu Asp Ser Pro Arg Ala 

740 745 750 

Leu Ala Leu Asp Pro Ala Glu Gly Phe Met Tyr Trp Thr Glu Trp Gly 

755 760 765 

Gly Lys Pro Lys He Asp Arg Ala Ala Met Asp Gly Ser Glu Arg Thr 

770 775 780 

Thr Leu Val Pro Asn Val Gly Arg Ala Asn Gly Leu Thr He Asp Tyr 
785 790 795 800 

Ala Lys Arg Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Leu He Glu 

805 810 815 

Ser Ser Asn Met Leu Gly Leu Asn Arg Glu Val He Ala Asp Asp Leu 

820 825 830 

Pro His Pro Phe Gly Leu Thr Gin Tyr Gin Asp Tyr He Tyr Trp Thr 

835 840 845 

Asp Trp Ser Arg Arg Ser He Glu Arg Ala Asn Lys Thr Ser Gly Gin 

850 855 860 

Asn Arg Thr He He Gin Gly His Leu Asp Tyr Val Met Asp He Leu 
865 870 875 880 

Val Phe His Ser Ser Arg Gin Ser Gly Trp Asn Glu Cys Ala Ser Ser 

885 890 895 

Asn Gly His Cys Ser His Leu Cys Leu Ala Val Pro Val Gly Gly Phe 

900 905 910 

Val Cys Gly Cys Pro Ala His Tyr Ser Leu Asn Ala Asp Asn Arg Thr 

915 920 925 

Cys Ser Ala Pro Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala He 

930 935 940 

Asn Arg Met Val He Asp Glu Gin Gin Ser Pro Asp He He Leu Pro 
945 ~ 950 955 960 

He His Ser Leu Arg Asn Val Arg Ala He Asp Tyr Asp Pro Leu Asp 

965 970 975 

Lys Gin Leu Tyr Trp He Asp Ser Arg Gin Asn Met He Arg Lys Ala 
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Gin Glu Asp 
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Ser Gin Gly Phe 


Thr Val 


Val 
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Val 


Pro 




995 
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Ser 
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Glu He 


Gin 
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Tyr Asp 
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Ser 


He 


Asp 


He 


Tyr 




1010 
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Ser 


Arg Tyr 


He 


Tyr Trp 


Thr 


Cys 


Glu Ala 


Thr 


Asn 


Val 


He 


Asn 


Val 


1025 




1030 






1035 








104i 


Thr 


Arg Leu 


Asp 


Gly Arg 


Ser 


Val 


Gly Val 


Val 


Leu 


Lys 


Gly 


Glu 


Gin 








1045 






1050 








1055 


Asp 


Arg Pro 


Arg 


Ala He 


Val 


Val 


Asn Pro 


Glu 


Lys 


Gly Tyr 


Met 


Tyr 






1060 






1065 








1070 




Phe 


Thr Asn 


Leu 


Gin Glu 


Arg 


Ser 


Pro Lys 


He 


Glu 


Arg 


Ala 


Ala 


Leu 



1075 1080 1085 



Asp Gly Thr Glu Arg Glu Val Leu Phe Phe Ser Gly Leu Ser Lys Pro 

1090 1095 1100 

He Ala Leu Ala Leu Asp Ser Arg Leu Gly Lys Leu Phe Trp Ala Asp 
1105 1110 1115 112' 

Ser Asp Leu Arg Arg He Glu Ser Ser Asp Leu Ser Gly Ala Asn Arg 

1125 1130 1135 

He Val Leu Glu Asp Ser Asn He Leu Gin Pro Val Gly Leu Thr Val 

1140 1145 1150 

Phe Glu Asn Trp Leu Tyr Trp He Asp Lys Gin Gin Gin Met He Glu 

1155 1160 1165 

Lys He Asp Met Thr Gly Arg Glu Gly Arg Thr Lys Val Gin Ala Arg 

1170 1175 1180 

He Ala Gin Leu Ser Asp He His Ala Val Lys Glu Leu Asn Leu Gin 
1185 1190 1195 120 

Glu Tyr Arg Gin His Pro Cys Ala Gin Asp Asn Gly Gly Cys Ser His 

1205 1210 1215 

He Cys Leu Val Lys Gly Asp Gly Thr Thr Arg Cys Ser Cys Pro Met 

1220 1225 1230 

His Leu Val Leu Leu Gin Asp Glu Leu Ser Cys Gly Glu Pro Pro Thr 

1235 1240 1245 

Cys Ser Pro Gin Gin Phe Thr Cys Phe Thr Gly Glu He Asp Cys He 

1250 1255 * 1260 

Pro Val Ala Trp Arg Cys Asp Gly Phe Thr Glu Cys Glu Asp His Ser 
1265 1270 1275 128i 

Asp Glu Leu Asn Cys Pro Val Cys Ser Glu Ser Gin Phe Gin Cys Ala 

1285 1290 1295 

Ser Gly Gin Cys He Asp Gly Ala Leu Arg Cys Asn Gly Asp Ala Asn 

1300 1305 1310 

Cys Gin Asp Lys Ser Asp Glu Lys Asn Cys Glu Val Leu Cys Leu He 

1315 1320 1325 

Asp Gin Phe Arg Cys Ala Asn Gly Gin Cys He Gly Lys His Lys Lys 

1330 1335 1340 

Cys Asp His Asn Val Asp Cys Ser Asp Lys Ser Asp Glu Leu Asp Cys 
1345 1350 1355 136< 

Tyr Pro Thr Glu Glu Pro Ala Pro Gin Ala Thr Asn Thr Val Gly Ser 

1365 1370 1375 

Val He Gly Val He Val Thr He Phe Val Ser Gly Thr Val Tyr Phe 

1380 1385 1390 

He Cys Gin Arg Met Leu Cys Pro Arg Met Lys Gly Asp Gly Glu Thr 

1395 1400 1405 

Met Thr Asn Asp Tyr Val Val His Gly Pro Ala Ser Val Pro Leu Gly 
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1410 1415 1420 

Tyr Val Pro His Pro Ser Ser Leu Ser Gly Ser Leu Pro Gly Met Ser 
1425 1430 1435 1440 

Arg Gly Lys Ser Met He Ser Ser Leu Ser He Met Gly Gly Ser Ser 

1445 1450 1455 

Gly Pro Pro Tyr Asp Arg Ala His Val Thr Gly Ala Ser Ser Ser Ser 

1460 1465 1470 

Ser Ser Ser Thr Lys Gly Thr Tyr Phe Pro Ala He Leu Asn Pro Pro 

1475 1480 1485 

Pro Ser Pro Ala Thr Glu Arg Ser His Tyr Thr Met Glu Phe Gly Tyr 

1490 1495 1500 

Ser Ser Asn Ser Pro Ser Thr His Arg Ser Tyr Ser Tyr Arg Pro Tyr 
1505 1510 1515 1520 

Ser Tyr Arg His Phe Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val 

1525 1530 1535 

Cys Asp Ser Asp Tyr Ala Pro Ser Arg Arg Met Thr Ser Val Ala Thr 

1540 1545 1550 

Ala Lys Gly Tyr Thr Ser Asp Leu Asn Tyr Asp Ser Glu Pro Val Pro 

1555 1560 1565 

Pro Pro Pro Thr Pro Arg Ser Gin Tyr Leu Ser Ala Glu Glu Asn Tyr 

1570 1575 1580 

Glu Ser Cys Pro Pro Ser Pro Tyr Thr Glu Arg Ser Tyr Ser His His 
1585 1590 1595 1600 

Leu Tyr Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 

<210> 765 
<211> 207 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> cytoplasmic domain and portion of transmembrane 
domain 

<400> 765 

Arg Val Val Cys Gin Arg Tyi 

1 5 
Glu Tyr Val Ser Gly Thr Pre 
20 

Gly Gly Ser Gin His Gly Pre 
35 

Met Met Ser Ser Val Ser Le\ 
50 55 
Tyr Asp Arg Asn His Val Th3 
65 " 70 

Thr Lys Ala Thr Leu Tyr Pre 
85 

Ala Thr Asp Pro Ser Leu Ty: 
100 

He Pro Ala Thr Val Arg Pr< 
115 

Ala Pro Pro Thr Thr Pro Cyi 

130 135 140 
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He 
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45 








Met 
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Val 


Pro 
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